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PROTEIN HAVING PDZ DOMAIN SEQUENCE 

This is a continuation-in-part of PCT/JP98/03603 
filed August 12, 1998, and claims priority from Japanese 
Patent Application Nos. 9/230356, filed August 12, 1997, 
and 10/189944, filed June 18, 1998. 

Technical Field 
The present invention relates to novel proteins having 
the PDZ domain sequence and also to gene encoding the 
proteins . 

Background Art 
Proteins such as PSD-95, hDlg, ZO-1, p55, Dsh, LIN-7, 
InaD, and PTPLl/FAPl are known to possess the PDZ domain 
and are called the PDZ family. A structure having 
approximately 80 to 90 amino acid residues, repeated three 
times and each containing a conserved "Gly-Leu-Gly-Phe 
(GLGF) " 4 amino acid motif (Neuron 9:929-942 (1992)), was 
initially identified in the 95 KDa post-synaptic density 
protein, PSD-95. The same domain structure was later found 
in the Drosophila lethal (1) discs large-1 tumor suppressor 
protein, DlgA (Cell 66:451-464 (1991)), and in the tight 
junction protein, ZO-1 (J. Cell Biol. 121:491-502 (1993)). 
The repeat sequence was therefore named the "PDZ domain" by 
combining the initials of PSD-95, DlgA, and ZO-1. (It is 
also called the "GLGF repeat" or "DHR (DlgA homology 
region) domain.") A protein having the PDZ domain is known 
to bind to other proteins by means of the sequence of this 
PDZ domain. For example, the PSD-95 protein is known to 
bind to the NMDA receptor 2B (Kornau, H. C, et al., 
Science 269:1737-1740 (1995)) and the Shaker-type K + channel 
(Kim, E . , et al., Nature 378:85-88 (1995)). The hDlg 
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protein has been reported to bind directly to the protein 
encoded by the adenomatous polyposis coli tumor suppressor 
gene/APC (Matsumine et al., Science 272:1020-1023 (1996)), 
and the Dsh protein has been reported to bind directly to 
the Notch protein (Axelrod, J. D., et al . , Science 
271:1826-1832 (1996)). Furthermore, the InaD protein has 
been reported to bind to a Ca 2+ channel protein, TRP, that 
functions in the Drosophila visual signal transduction 
cascade (Shieh, B. H. and Zhu, M. Y., Neuron 16:991-998 
(1996)). The structure of proteins having the PDZ domain 
varies because some of the proteins contain only one domain 
(p55 and Dsh), while others contain two (SIP-1: Yanagisawa, 
J., et al., J. Biol. Chem. 272:7167-7172 (1997)), three 
(PSD-95 and hDlg) , five (InaD and PTPL1/FAP1), seven (GRIP: 
Dong, H., et al., Nature 386:279-284 (1997)), or thirteen 
(Ullmer, C, et al., FEBS Letters 424:63-68 (1998)). Also 
a recently reported mouse gene lacks a region encoding an 
N-terminal peptide of the protein, but which encodes a 
peptide having four PDZ domains within this incomplete 
genetic region (Recorded to GenBank on May 18, 1997; 
accession number AF000168) . Although there are a few 
exceptions, proteins having the PDZ domain are known to 
bind to other proteins that have a hydrophobic amino acid 
region consisting of three amino acids represented by 
"Thr/Ser-Xaa-Val" (Xaa being an arbitrary amino acid 
residue) at their C-terminus. Most of these proteins are 
transmembrane proteins and are presumed to function in 
signal transduction within the cell (TIBS 21:455-458 
(1996), Yanagisawa, J., et al., J. Biol. Chem. 272:7167- 
7172 (1997)). 

Since the above proteins having the PDZ domain and 
proteins that interact with these proteins are involved in 
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neural transmission, apoptosis, and malignant conversion, 
they have recently drawn attention as targets for 
developing pharmaceuticals. 

Disclosure of the Invention 
An objective of the present invention is to provide a 
novel protein having the PDZ domain sequence and a DNA 
encoding the protein. Another objective of the present 
invention is to provide a vector containing the DNA, a 
transformant harboring the DNA in which the DNA can be 
expressed, and a method of producing the recombinant 
protein utilizing the transformant. A further objective of 
the present invention is to provide an antisense DNA 
against the DNA and antibody that binds to the protein. 
Still another objective of the present invention is to 
provide a screening method for proteins that bind to the 
PDZ-domain protein. 

While analyzing the changes of gene expression in 
human umbilical vascular endothelial cells by TNFa, the 
present inventors isolated a gene whose expression was 
increased by TNFa stimulation. Screening was performed 
using the gene as a probe, and, as a result, a gene 
encoding novel proteins was isolated. The present 
inventors analyzed the structure of the proteins encoded by 
the isolated gene and found that the proteins contain 
within the molecule the PDZ domain sequence that plays an 
important role in the interactions with other proteins 
involved in neural transmission, apoptosis, and malignant 
conversion. The present inventors also found that the 
single gene produces at least five different 
transcriptional products through the differences in 
transcription initiation sites and in splicing. 



Attorney Docket: 06501/056001 



The present inventors succeeded in preparing the 
proteins encoded by the gene as recombinant proteins by 
incorporating the isolated gene into an expression vector, 
and by transfecting it into E. coli cells and culturing the 
5 cells. In addition, by immunizing rabbits with the 

proteins thus prepared, the present inventors succeeded in 
preparing antibodies that bind to the proteins. 

The present invention relates to a group of novel 
proteins having the PDZ domain sequence within the molecule 
10 and to their gene, and more specifically, to 

(1) a protein comprising the amino acid sequence 
described in SEQ ID NOs: 1, 2, 82, 83, or 84; 

(2) a protein comprising the amino acid sequence 
described in SEQ ID NOs: 1, 2, 82, 83, or 84, in which one 

15 or more amino acids have been substituted, deleted, and/or 
added, and having affinities to other proteins 
characteristic to the PDZ domain; 

(3) a fusion protein comprising the protein described 
in (1) or (2) and a protein or a peptide containing at 

20 least one antibody recognition site; 

(4) a DNA encoding the protein of any one of (1) 
through (3) ; 

(5) an antisense DNA against the DNA or a part 
thereof whose nucleotide sequence is described in SEQ ID 

25 NO: 2; 

(6) a vector containing the DNA of (4); 

(7) a transformant harboring the DNA of (4), in which 
the DNA can be expressed; 

(8) a method of producing the protein of any one of 
30 (1) through (3), comprising the process of culturing the 

transf ormants described in (7); 
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(9) a screening method for proteins that bind to the 
protein of (1) or (2), comprising the process of selecting 
the proteins that bind to the proteins by contacting sample 
proteins with the proteins of any one of (1) through (3) ; 

(10) a screening method for genes encoding the 
proteins that bind to the proteins of (1) or (2), 
comprising the process of selecting the genes corresponding 
to the gene products that bind to the proteins of (1) or 
(2) by contacting the gene products of the sample genes 
with the protein of (1) or (2); 

(11) a protein that binds to the protein of (1) or 

(2); 

(12) the protein of (11) that can be isolated by the 

method of (9) ; 

(13) a gene encoding a protein that bind to the 

protein of (1) or (2) ; 

(14) the gene of (13) that can be isolated by the 
method of (10) ; and 

(15) an antibody that bind to the protein of (1) or 

(2) . 

In the present invention, the "PDZ domain sequence" 
refers to a sequence having 80 to 90 amino acids, 
containing the four amino acid motif that consists of 
"Gly-Leu-Gly-Phe" or similar amino acids (cf. TIBS 20:102- 
103 (1995) ) . 

The present invention relates to novel proteins having 
the PDZ domain sequence. Although there are a few 
exceptions, proteins having the PDZ domain are known to 
interact with other proteins that have a hydrophobic amino 
acid region at their C-terminal ends. The other proteins 
are transmembrane proteins and are presumed to function in 
signal transduction within the cell (TIBS 21: 455-458 
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(1996), Yanagisawa, J., et al., J. Biol. Chem. 272:7167- 
7172 (1997)). 

The present inventors have discovered five different 
transcription products among those that encode proteins 
having the PDZ domain. These products are thought to arise 
from a single gene through differences in transcription 
initiation sites and in splicing. The amino acid sequences 
of the proteins encoded by these transcription products are 
shown in SEQ ID NOs: 1, 2, 82, 83, and 84. 

The protein having the amino acid sequence described 
in SEQ ID NO: 1, which is included in the proteins of the 
present invention, possesses nine PDZ domains that 
correspond to amino acid positions 69 to 158 (SEQ ID 
NO: 4), positions 371 to 461 (SEQ ID NO: 5), positions 520 
to 615 (SEQ ID NO: 6), positions 649 to 734 (SEQ ID NO: 7), 
positions 782 to 865 (SEQ ID NO: 8), positions 928 to 1013 
(SEQ ID NO: 9), positions 1024 to 1108 (SEQ ID NO: 10), 
positions 1161 to 1249 (SEQ ID NO: 11), and positions 1286 
to 1373 (SEQ ID NO: 12) (see Figure 8). 

Similarly, the protein having the amino acid sequence 
described in SEQ ID NO: 2, which is also included in the 
proteins of the present invention, corresponds to amino 
acids 369 to 1373 of the sequence described in SEQ ID 
NO: 1. The difference between the structures of these 
proteins is considered to arise from the difference in the 
mRNA transcription initiation sites. 

The protein described in SEQ ID NO: 2 possesses a 
total of eight PDZ domain sequences, corresponding to amino 
acids 3 to 93, 152 to 247, 281 to 366, 414 to 497, 560 to 
645, 656 to 740, 793 to 881, and 918 to 1005. However, it 
does not possess the first PDZ domain found in the protein 
described in SEQ ID NO: 1. Although its biological 
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significance is not clear, considering the specific 
expression of the mRNA corresponding to the protein 
described in SEQ ID NO: 2 in the liver (Example 5) and the 
fact that the PDZ domain plays an important role in 
protein-protein interactions, the protein described in SEQ 
ID NO: 2, by lacking this domain, may be involved in 
controlling the signal in the liver differently from the 

other tissues. 

The protein having the amino acid sequence described 
in SEQ ID NO: 82 (the 32-8-la protein) , which is also 
included in the proteins of the present invention, consists 
of 2,000 amino acids. These amino acids are predicted by 
combining sequences of two cDNAs. One cDNA was discovered 
in the search for a cDNA derived from the human brain and 
contains a 5' upstream region of a cDNA encoding a protein 
having the amino acid sequence described in SEQ ID NO: 1. 
The other cDNA (SEQ ID NO: 3) encodes the protein having 
the amino acid sequence described in SEQ ID NO: 1. The 
32-8-la protein possesses a total of 13 PDZ domain 
sequences, corresponding to amino acids 133 to 222, 253 to 
335, 373 to 461, 549 to 632, 696 to 784, 1004 to 1087, 1147 
to 1240, 1276 to 1361, 1409 to 1492, 1555 to 1640, 1651 to 
1735, 1788 to 1870, and 1913 to 2000 (Figure 25). 

Similarly, the amino acid sequences of the proteins 
encoded by the two splicing variants that are thought to 
result from the different splicing from the transcription 
product encoding the 32-8-la protein and are also included 
in the proteins of the present invention, are shown in SEQ 
ID NO: 83 (the 32-8-lb protein) and in SEQ ID NO: 84 (the 
32-8-lc protein) . The 32-8-lb protein, similar to the 32- 
8-la protein, consists of 2,070 amino acids, possessing 13 
PDZ domains. The PDZ domains of the 32-8-lb protein exist 



Attorney Docket: 06501/056001 



at positions 133 to 222, 253 to 335, 373 to 461, 549 to 
632, 696 to 784, 1004 to 1087, 1147 to 1241, 1346 to 1431, 
1479 to 1562, 1625 to 1710, 1721 to 1805, 1858 to 1946, and 
1983 to 2070 of its amino acid sequence. 
5 In contrast, the 32-8-lc protein has a shorter chain 

length than 32-8-la or 32-8-lb because of the termination 
codon created by the splicing, and consists of 1,239 amino 
acids, possessing seven PDZ domains. The PDZ domains of 
the 32-8-lc protein exist at positions 133 to 222, 253 to 

10 335, 373 to 461, 549 to 632, 696 to 784, 1004 to 1087, and 
1147 to 1239 of its amino acid sequence. 

It is clinically very significant that these proteins 
of the present invention are all of human origin, as 
opposed to being derived from other animals. In 

15 particular, proteins derived from other organisms (e.g., 

mice or rats) cause serious side effects such as reduction 
or loss of therapeutic effects by generating antibodies or 
by inducing serum sickness and anaphylactic shock, due to 
the immunogenicity when they are used to treat humans. 

20 Therefore, it is desirable to use proteins of human origin 
as therapeutic materials for humans. 

The proteins of the present invention can be prepared 
from natural proteins, but they can also be prepared as 
recombinant proteins using recombinant genetics technology. 

25 The natural proteins can be isolated from such sources as 
the human umbilical vascular endothelial cells (HUVEC) by 
means of methods well-known to persons skilled in the art. 
For example, they can be isolated as described below, with 
an affinity column in which an antibody against the protein 

30 of the present invention has been bound to an appropriate 
support. The affinity column can be constructed, for 
example, according to the method described by Wilchek et 
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al. (Wilchek et al., Methods Enzymol. 104:3-55 (1984)). 
Furthermore, the recombinant protein can be prepared by 
culturing the cells transformed with the DNA encoding the 
protein of the present invention, as will be described 
later. 

The proteins of the present invention also include 
functional derivatives of the proteins having the amino 
acid sequences described in SEQ ID NOs: 1, 2, 82, 83, and 
84. A "functional derivative" means a protein that differs 
from the amino acid sequences described in SEQ ID NOs: 1, 
2, 82, 83, and 84 by one or more amino acid residues 
through substitution, deletion, or addition, but that still 
maintains the affinity to the other proteins characteristic 
of the PDZ domain. This affinity normally arises from the 
affinity to a hydrophobic amino acid region that exists in 
the C-terminal ends of the other proteins. The hydrophobic 
amino acid region contains a hydrophobic amino acid motif 
represented by "Thr/Ser-Xaa-Val" (Xaa being an arbitrary 
amino acid residue) (cf. Science 269:1737 (1995), Nature 
378:85 (1995), Science 277:1511 (1997), Neuron 20:693 
(1998), Oncogene 16:643 (1998), J. Biol. Chem. 273:1591 
(1998), Science 272:1020 (1996), Proc. Natl. Acad. Sci. USA 
94:6670 (1997), Proc. Natl. Acad. Sci. USA 94:11612 (1997), 
J. Neurosci. 18:128 (1998), J. Neurosci. 16:7407 (1996), 
Nature Biotech. 15:336 (1997), FEBS Letters 409:53 (1997), 
Nature 386:284 (1997), Nature 386:279 (1997), Nature 
Structure Biol. 5:19 (1998), J. Neurosci. 16:24 (1996), J. 
Biol. Chem. 272:24191 (1997), Science 271:1826 (1996), TIBS 
21:455 (1996), Cell 85:195 (1996), Neuron 18:95 (1997), 
Proc. Natl. Acad. Sci. USA 94:12682 (1997), J. Biol. Chem. 
272:8539 (1997), J. Biol. Chem. 272:24333 (1997), J. Biol. 
Chem. 272:7167 (1997), Proc. Natl. Acad. Sci. USA 94:13683 
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(1997), Nature 392:6676 (1998), J. Biol. Chem. 272:32019 
(1997), Mol. Biol- Cell 9:671 (1998)). 

Functional derivatives occur naturally or can be 
produced artificially; both of these are included in the 
present invention. Methods to alter amino acids, which are 
well known to persons skilled in the art, include the 
methods developed by Kunkel et al. (Methods Enzymol. 
85:2763-2766 (1988)) and those that utilize polymerase 
chain reaction (PCR) . In the Kunkel method, uracil is 
incorporated by using dut" or ung" E. coli as a host when 
preparing the single-stranded DNA to be used as the 
template. Primers containing the desired mutations are 
annealed to this template containing uracil, and ordinary 
DNA synthesis is performed in vitro. When the double- 
stranded DNA thus produced with the uracil-containing DNA 
is introduced into ordinary E. coli cells, the uracil- 
containing DNA strand becomes degraded, and DNA synthesis 
proceeds with the mutated DNA strand as the template. As a 
result, DNA into which mutations have been introduced can 
be obtained with a very high efficiency. An example of the 
methods of introducing mutations using PCR follows. Two 
sets of primers are prepared. One of the primers in each 
set encompasses the region into which the mutation will be 
introduced, and the other contains a restriction enzyme 
recognition site or a sequence just outside of it. A 
region containing appropriate restriction enzyme sites is 
thus targeted. PCR reactions are then performed with the 
two sets of primers. After the products of the two PCR 
reactions are mixed, the DNA is amplified using primers 
having sequences corresponding to the recognition sites of 
the two restriction enzymes or the sequences just outside 
of them. The product is next digested with appropriate 
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restriction enzymes so that the resultant fragment contains 
the region into which the mutation has been introduced. 
The fragment thus obtained is substituted for the said 
region in the original DNA (Saiki et al., Science 239:487- 
491 (1988), Current Protocols in Molecular Biology, Greene 
Publishing Associates and Wiley - Interscience Publishing, 
Unit 8.5.1-8.5.10 (1997), Supplement to Jikken Igaku 
(Experimental Medicine) "Shin Idenshi Kogaku Handbook (New 
Genetic Engineering Handbook)/' Yohdosha, pp251-261) . The 
desired number of amino acids to be substituted in a 
functional derivative is generally 10 or less, more 
preferably 6 or less, and still more preferably 3 or less. 

The term "substantially pure" as used herein in 
reference to a given polypeptide means that the polypeptide 
is substantially free from other biological macromolecules . 
The substantially pure polypeptide is at least 75% (e.g., 
at least 80, 85, 95, or 99%) pure by dry weight. Purity 
can be measured by any appropriate standard method, for 
example, by column chromatography, polyacrylamide gel 
electrophoresis, or HPLC analysis. 

A "conservative amino acid substitution" is one in 
which an amino acid residue is replaced with another 
residue having a chemically similar side chain. Families 
of amino acid residues having similar side chains have been 
defined in the art. These families include amino acids 
with basic side chains (e.g., lysine, arginine, histidine) , 
acidic side chains (e.g., aspartic acid, glutamic acid), 
uncharged polar side chains (e.g., glycine, asparagine, 
glutamine, serine, threonine, tyrosine, cysteine), nonpolar 
side chains (e.g., alanine, valine, leucine, isoleucine, 
proline, phenylalanine, methionine, tryptophan) , beta- 
branched side chains (e.g., threonine, valine, isoleucine) 
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and aromatic side chains (e.g., tyrosine, phenylalanine, 
tryptophan, histidine) . 

The present invention also relates to the DNA encoding 
the proteins of the present invention described above. The 
DNA encoding the proteins of the present invention can be 
cDNA, genomic DNA, or synthetic DNA. The DNA of the 
present invention can be used, for example, to produce the 
proteins of the present invention as recombinant proteins. 
More specifically, the proteins of the present invention 
can be prepared as recombinant proteins by inserting the 
DNA encoding the proteins of the present invention into 
appropriate expression vectors, culturing the transf ormants 
obtained by introducing the said vectors into appropriate 
cells, and purifying the expressed proteins. 

By hybridization under "stringent conditions" is meant 
hybridization at 37 °C, 1 X SSC, followed by washing at 
42°C, 0.5 X SSC. 

The "percent identity" of two amino acid sequences or 
of two nucleic acids is determined using the algorithm of 
Karlin and Altschul (Proc. Natl. Acad. Sci. USA 87:2264- 
2268, 1990), modified as in Karlin and Altschul (Proc. 
Natl. Acad. Sci. USA 90:5873-5877, 1993). Such an 
algorithm is incorporated into the N BLAST and XBLAST 
programs of Altschul et al. (J. Mol. Biol. 215:403-410, 
1990) . BLAST nucleotide searches are performed with the 
NBLAST program, score = 100, wordlength = 12. BLAST 
protein searches are performed with the XBLAST program, 
score = 50, wordlength = 3. Where gaps exist between two 
sequences, Gapped BLAST is utilized as described in 
Altschul et al. (Nucleic Acids Res. 25:3389-3402, 1997). 
When utilizing BLAST and Gapped BLAST programs, the default 
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parameters of the respective programs (e.g., XBLAST and 
NBLAST) are used. See http://www.ncbi.nlm.nih.gov. 

An "isolated nucleic acid" is a nucleic acid the 
structure of which is not identical to that of any 
naturally occurring nucleic acid or to that of any fragment 
of a naturally occurring genomic nucleic acid spanning more 
than three separate genes. The term therefore covers, for 
example, (a) a DNA which has the sequence of part of a 
naturally occurring genomic DNA molecule but is not flanked 
by both of the coding sequences that flank that part of the 
molecule in the genome of the organism in which it 
naturally occurs; (b) a nucleic acid incorporated into a 
vector or into the genomic DNA of a prokaryote or eukaryote 
in a manner such that the resulting molecule is not 
identical to any naturally occurring vector or genomic DNA; 
(c) a separate molecule such as a cDNA, a genomic fragment, 
a fragment produced by polymerase chain reaction (PCR) , or 
a restriction fragment; and (d) a recombinant nucleotide 
sequence that is part of a hybrid gene, i.e., a gene 
encoding a fusion protein. Specifically excluded from this 
definition are nucleic acids present in mixtures of 
different (i) DNA molecules, (ii) transfected cells, or 
(iii) cell clones: e.g., as these occur in a DNA library 
such as a cDNA or genomic DNA library. 

The cells to be used for producing the recombinant 
proteins include, but are not limited to, animal cells such 
as Chinese hamster ovary (CH0) cells, COS cells (a cell 
line obtained by transforming monkey CV-1 fibroblasts by 
the SV40 virus lacking the replication origin) , mouse 
NIH3T3 cells, human HeLa cells, and human lymphoid Namalva 
cells (Current Protocols in Molecular Biology, Greene 
Publishing Associates and Wiley - Interscience Publishing, 
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Unit 16.12-16.14 (1991)). As the vectors, pSV2neo, pcDNAI, 
pCD8, pRcRSV, pREP4, pCEP4 (Invitrogen) , pMAM, pMAMneo 
(Clontech) , pCI-neo mammalian expression vector, pSI-neo 
mammalian expression vector, pTARGET™ mammalian expression 
vector (Promega), and the like can be used. Both plasmid 
vectors and recombinant viruses can be constructed for 
producing the recombinant protein. Recombinant 
adenoviruses using the pAdex vector (Supplement to Jikken 
Igaku (Experimental Medicine) "Shin Idenshi Kogaku Handbook 
(New Genetic Engineering Handbook)," Yohdosha, pp238-244), 
the LN and LXSN vector series, the pBabe vector series (a 
modified version of the preceding series), recombinant 
retroviruses using such vectors as the MFG vectors 
(Supplement to Jikken Igaku (Experimental Medicine) "Shin 
Idenshi Kogaku Handbook (New Genetic Engineering 
Handbook)," Yohdosha, pp245»250, Current Protocols in 
Molecular Biology, Greene Publishing Associates and Wiley - 
Interscience Publishing, Unit 9.10.1-9.14.3 (1992)), 
Sindbis viruses, and vaccinia viruses (Current Protocols in 
Molecular Biology, Greene Publishing Associates and 
Wiley - Interscience Publishing, Unit 16.15.1-16.19.9 
(1992)) can also be used to produce the recombinant 
proteins. It is also possible to produce the recombinant 
proteins by utilizing baculoviruses, and silkworm larvae. 
Alternatively, cultured cell lines such as SF21, SF9, and 
High Five™ cells can be used as the host (Supplement to 
Jikken Igaku (Experimental Medicine) "Bio Manual Series 7: 
Bunshi Seibutsu Kenkyu No Tame No Tanpakushitsu Jikken Hou 
(Protein Experimentation Methods for Molecular Biology 
Research)," Yohdosha, ppl67-171 (1994), OReilly, D. R. et 
al., "Baculovirus Expression Vectors, A Laboratory Manual," 
Oxford University Press (1992)). As the baculovirus 
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expression vectors, pBacPAK8, 9, pBacPAK-His 1/2/3, pAcUW31 
(Clontech), pBlueBac (Invitrogen) , pBAC, pBACgus (Novagen) , 
etc. , can be used. 

The promoters utilized to express the proteins 
efficiently in animal cells include, for example, the SV40 
early promoter (Rigby In Williamson (ed.), Genetic 
Engineering, Vol. 3, Academic Press, London, pp83-141 
(1982)), the EF-la promoter (Kim et al., Gene 91:217-223 

(1990) ), the CAG promoter (Niwa et al., Gene 108:193-200 

(1991) ), the RSV LTR promoter (Cullen, Methods in 
Enzymology 152:684-704 (1987)), the SRa promoter (Takabe et 
al., Mol. Cell. Biol. 8:466 (1988)), the CMV early promoter 
(Seed and Aruffo, Proc. Natl. Acad. Sci. USA 84:3365-3369 
(1987)), the SV40 late promoter (Gheysen and Fiers, J. Mol. 
Appl. Genet. 1:385-394 (1982)), the Adenovirus late 
promoter (Kaufman et al., Mol. Cell. Biol. 9:946 (1989)), 
the HSV TK promoter, and inducible expression promoters. 
The MMTV promoter induced by glucocorticoids, the MT 
(metallothionein) II promoter induced by phorbol esters or 
heavy metals, the Tet-On/Off system that can be turned on 
and off by tetracycline (Clontech) , the expression system 
that can be induced by ecdysone (Invitrogen) , and the Lac 
Switch expression system induced by IPTG are preferred 
examples of the inducible expression promoters. 

It is also possible to use yeast cells to produce the 
proteins. Protease-def icient cell lines such as BJ2168, 
BJ926, and CB023, and cell lines for secretion vectors, 
such as 20B-12, can be used as hosts (Supplement to Jikken 
Igaku (Experimental Medicine) "Bio Manual Series 4: Idenshi 
Donyu To Hatsugen Kaisekihou (Gene Introduction and 
Expression Analysis Methods)," Yohdosha, ppl66-176 (1994)). 
The expression vectors include pYEUra3 (Clontech) , pYEX™- 
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BX, and pYEX™-Sl. It is also possible to express the 
protein in fission yeast SP-Q01, using fission yeast 
expression vector pESP-1 (Stratagene) . The PGK promoter 
and the ADH1 promoter, which are constitutive; the CUPl 
promoter, which is inducible by copper ions; the Gall-GallO 
promoter, which is induced by galactose and repressed by 
glucose; and the PH05 promoter, which is induced by a 
reduction in phosphate concentrations and repressed by high 
phosphate concentrations are preferable as promoters that 
efficiently express the protein in the yeast cells. In 
fission yeast, promoters such as the nmtl promoter are 
preferable . 

Four broad categories of expression promoters can be 
used to produce recombinant proteins using E. coli cells. 
The XPL promoter is regulated by the clts857 repressor and 
is induced by heat shock. N4830-1 and M5219 can be used as 
the host, and vectors such as pPL-lambda, pKC30, and pRIT2T 
can be used for expression. The tac promoter is regulated 
by the lacl q repressor and is induced by adding isopropyl 
(3-D-thiogalactoside (IPTG) . JM105 and XLl-Blue can be used 
as the host, and vectors such as pDR540, pKK233-3, pGEX-3X, 
and pMAL-c2 can be used for expression. The trp promoter 
is regulated by the trp repressor and is induced by adding 
P indole acrylic acid (IAA) . HB101 and the like can be 
used as the host; vectors such as pBTrp2 can be used for 
expression. The T7 phage promoter is recognized for 
expression by only the T7RNA polymerase. Therefore, the 
BL21 (DE3) strain can be used as the host. This strain can 
be prepared by lysogenizing the E. coli BL21 strain with X 
phage DE3, into which the lad gene and a DNA fragment 
containing the T7RNA polymerase gene under the control of 
the lacUVS promoter are inserted within its int gene. The 
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inducible expression directed by the T7 promoter becomes 
possible by adding IPTG, which induces the T7RNA 
polymerase. The vectors include pET-3c and pET-8c. 
BL21 (DE3)pLysS contains, in addition to the above plasmid, 
a plasmid producing the T7 lysozyme. This lysozyme is a 
natural inhibitor that binds to the T7RNA polymerase and 
inhibits its transcription, in order to suppress the basal 
level T7RNA polymerase activity. Therefore, BL21 (DE3) pLysS 
can also be used as the host, pET-llc, pET-lld, and the 
like, which possess the T71ac promoter with the lac 
operator sequence inserted downstream of the T7 promoter 
transcription initiation site, can also be used as the 
expression vector (Studier, F. , et al., J. Mol. Biol. 
189:113-130 (1996), Studier, F. , et al., Methods Enzymol. 
185:60-8 (1990)). 

Methods of introducing the vector into the host 
include the electroporation method (Chu, G., et al., Nucl. 
Acids Res. 15:1311-1326 (1987)), the calcium phosphate 
method (Chen, C. and Okayama, H., Mol. Cell. Biol. 7:2745- 
2752 (1987)), the DEAE dextran method (Lopata, M. A., et 
al., Nucl. Acids Res. 12:5707-5717 (1984); Sussman, D. J. 
and Milman, G., Mol. Cell. Biol. 4:1642-1643 (1985)), and 
the lipofectin method (Derijard, B., Cell 7:1025-1037 
(1994); Lamb, B. T., et al., Nature Genetics 5:22-30 
(1993); Rabindran, S. K. , et al., Science 259:230-234 
(1993)), but any method can be used. 

The recombinant protein can be purified from the 
transformant thus obtained by means of the gel filtration 
method, ion exchange chromatography, affinity 
chromatography, reverse phase chromatography, 
hydroxyapatite chromatography, hydrogen bonding 
chromatography, and chelating columns (Deutscher, M. P., 
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eel., Methods Enzymol. 182, Guide to Protein Purification, 
1990; Principles and Methods Series: Gel Filtration, Ion 
Exchange chromatography, and Affinity chromatography. 
Pharmacia) . Antibodies against the protein of the present 
invention are prepared as described below, and the protein 
can be highly purified by means of affinity chromatography 
using the antibodies. 

Persons skilled in the art can, by using the prepared 
protein of the present invention, easily prepare the 
antibodies that bind to it. The antibodies of the present 
invention can be obtained by expressing the gene of the 
present invention using an appropriate E. coli expression 
vector; purifying the product; and immunizing rabbits, 
mice, rats, goats, or chickens with it. It is also 
possible to synthesize peptides that correspond to 
appropriate regions of the protein encoded by the gene of 
the present invention, and to immunize the animals 
described above, thereby obtaining the antibodies to the 
gene product. Methods to establish mouse or rat hybridomas 
can be used to produce monoclonal antibodies (Kohler and 
Milstein, Nature 256:495-497 (1975)). Specifically, mice, 
rats, or Armenian hamsters are first immunized with the 
prepared protein of the present invention. The antibody- 
producing cells are then collected from the spleen or the 
lymph nodes and fused in vitro with myeloma cells, and 
clones are selected through screening using the antigen 
(Harlow, E. and Lane, D., Antibodies: A Laboratory Manual. 
Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 
(1988)). The mouse myeloma cells include p3-x63-Ag8-Ul 
(P3-U1), P3-NSI/l-Ag4-l (NS-1) , and SP2/0-Agl4 (AP2/0) , and 
the rat myeloma cells include YB2/3HL. P2G11 . 16Ag20 (YB2/0) . 
The cells can be fused using polyethylene glycol or 
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electric pulses. Monoclonal antibodies, such as that 
contained in the cultured supernatant of the hybridomas and 
that contained in the ascites of the mouse treated with an 
immunosuppressant and with the mass-cultured hybridoma 

5 injected into its abdominal cavity, can be purified by, for 
example, protein A-Sepharose (Pharmacia) . Furthermore, 
monoclonal antibodies can also be purified using an 
affinity column having the protein of the present invention 
immobilized onto the support (Harlow, E. and Lane, D., 

10 Antibodies: A Laboratory Manual. Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York (1988)). 

When the antibodies thus obtained are administered to 
humans, it is beneficial to use a human or humanized 
antibody in order to reduce the immunogenicity . The 

15 methods to humanize antibodies include the CDR graft 
method, in which the antibody gene is cloned from the 
monoclonal antibody producing cells and the antigen 
determining region is transplanted to a known human 
antibody (Immunology Methods Manual 1: pp98-107, Academic 

20 Press) . Human antibodies can also be produced by 

immunizing a mouse that has its immune system replaced with 
the human immune system, following a procedure similar to 
the one used with regular monoclonal antibodies. The human 
B cell hybridoma method (Kozbor, et al., Immunology Today 

25 4:72 (1983)), and the Epstein-Barr virus (EBV) - Hybridoma 
method (Cole, et al. in Monoclonal Antibodies and Cancer 
Therapy, Ala R. Liss, Inc. pp77-96 (1985)) can also be used 
to produce monoclonal antibodies. 

The antibodies thus obtained can be used not only to 

30 detect the proteins of the present invention and as 
antibody therapies, but also to screen the proteins 
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described below that interact with the proteins of the 
present invention . 

The present invention also relates to the methods used 
to screen for the proteins that bind to the proteins of the 
present invention. The group of proteins having the PDZ 
domain, such as those of the present invention, share a 
common property of interacting with other proteins having 
the region of hydrophobic amino acids on the C-terminus. 
These and other binding proteins can be isolated by the 
screening methods of the present invention. These 
screening methods include the process to select the 
proteins that bind to the proteins of the present 
invention. In such a process, the sample proteins are 
brought into contact with the proteins of the present 
invention in the form of lysates from the cells or tissues 
that are expected to contain the target proteins. 

An example of the specific methods is the 
immunoprecipitation method. The immunoprecipitation method 
is the most common method used to detect protein-protein 
binding. In immunoprecipitation, biological samples, such 
as lysates from cells or tissues, for example, cell lysates 
prepared by dissolving cells such as human umbilical 
vascular endothelial cells with Triton X-100 or sodium 
deoxycholate, are usually brought into contact with the 
proteins of the present invention. The antibodies are then 
applied to the complex thus formed between the proteins of 
the present invention with their binding proteins. The 
immune complexes thus formed are then precipitated 
(Supplement to Jikken Igaku (Experimental Medicine) "Shin 
Idenshi Kogaku Handbook (New Genetic Engineering 
Handbook) ," Yohdosha, pp304-308 (1996)). 
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The immune complex can be precipitated by, for 
example, using protein A-Sepharose or protein G-Sepharose 
when the antibody is a mouse IgG antibody. General methods 
can be found, for example, in "Antibodies" (Harlow, E. and 
Lane, D., Antibodies. pp511-552, Cold Spring Harbor 
Laboratory Publications, New York (1988)). Moreover, 
methods based on those described above can generally be 
used even in the case of antibodies from other animal 
species . 

The proteins of the present invention, which are used 
in the immunoprecipitation, can have a recognition site 
(epitope) for the monoclonal antibody, whose specificity 
has been well characterized, that is introduced into the N- 
terminus or the C-terminus of the proteins. The proteins 
have thus been made into fusion proteins with the epitope, 
and the immune complexes can be formed by reacting the 
antibody to the epitope. 

A variety of epitope-antibody systems are commercially 
available, and these can also be used (Jikken Igaku 
(Experimental Medicine) 13:85-90 (1995)). Some 
commercially available vectors can express relatively large 
fusion proteins, such as those with p-galactosidase, 
maltose-binding protein, glutathione S-transf erase, and 
Green fluorescent protein, by incorporating the DNA 
encoding the desired protein through multi-cloning sites. 
In order to minimize the changes in the properties of the 
desired protein due to fusing, methods have been reported 
in which only a small epitope portion having several to a 
dozen or so amino acids is inserted. For example, the 
epitopes in poly-histidine (His-tag) , influenza 
hemagglutinin HA, human c-myc, FLAG, vesicular stomatitis 
virus glycoprotein (VSV-GP) , T7 gene 10 protein (T7-tag) , 
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human herpes simplex virus glycoprotein (HSV-tag) , E-tag 
(an epitope on a monoclonal phage) , etc., and their 
corresponding antibodies that recognize the epitopes can be 
used (Jikken Igaku 13: 85-90 (1995)), Any other epitope- 
antibody system can be used, as long as it can detect the 
fusion protein. It should be noted that the fusion 
proteins that bind to the proteins of the present invention 
can be isolated by means of affinity chromatography, 
without using antibodies. For example, the glutathione- 
Sepharose 4B column can be used for a GST-fusion protein. 

SDS-PAGE is generally used to analyze the 
immunoprecipitated proteins. In this method, gel of an 
appropriate concentration is used according to the 
molecular weights of the proteins so that the bound 
proteins can be analyzed. It is generally difficult to 
detect the bound proteins with ordinary staining methods 
for proteins (e.g., the Coomassie Brilliant Blue (CBB) 
staining method or the silver staining method) . However, 
the cells can be cultured in a medium to which 
35 S-methionine or 35 S-cysteine has been added to label the 
proteins, in order to increase the detection sensitivity. 
Once the molecular weight of a protein becomes known, it is 
possible to purify the protein directly from the 
SDS-polyacrylamide gel and to determine its sequence. In 
addition to the immunoprecipitation method described above, 
it is also possible to prepare the proteins by running the 
culture supernatant or the cellular extracts of the cells 
expected to express the proteins that bind to the proteins 
of the present invention through an affinity column having 
the proteins of the present invention immobilized onto it, 
then purifying the proteins that specifically bound to the 
column. 
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It is also possible to directly screen for the genes 
encoding the proteins that bind by using the proteins of 
the present invention. In this screening method, the gene 
products of the sample genes are brought into contact with 

5 the proteins of the present invention, thereby selecting 
the genes corresponding to the gene products that bind to 
the proteins of the present invention. There are no 
restrictions on the sample genes, but cDNA libraries 
prepared from the cells expected to express the proteins 

10 that bind to the proteins of the present invention are 

preferable. A specific example of the method utilizes the 
yeast 2 hybrid system (Fields, S. and Song, 0., Nature 
340:245-247 (1989)). Namely, one can express the proteins 
of the present invention within the yeast cells by fusing 

15 them with the SRF binding region, GAL 4 binding region, or 
LexA binding region. One can then introduce the cDNA 
libraries prepared from the cells expected to express the 
proteins that bind to the proteins of the present invention 
into the above yeast cells so that the proteins are 

20 expressed in a form fused with the VP16, GAL 4 transcription 
activation domain, or the E. coli B42 peptide. Finally, 
one can isolate the library-derived cDNA from the positive 
clones. (When a protein that binds to the protein of the 
present invention is expressed within the yeast cell, the 

25 binding between these proteins activates the reporter gene, 
enabling the detection of the positive clone.) 

The vectors and expression libraries to be used in 
this system can be purchased from several sources 
(Clontech, MATCHMAKER Two-Hybrid System; Stratagene, 

30 HybriZAP II Two-Hybrid System) . For the specific method, 
one can follow the manufacturer's manual. The genes 
encoding the proteins that bind to the proteins of the 
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present invention can be obtained directly by this method. 
In fact, the bindings between APC and hDLG (Matsumine, A., 
et al., Science 272:1020-1023 (1996)), between GRIP and the 
AM PA receptor (Dong, H., et al., Nature 386:279-284 

5 (1997)), between Homer and the glutamate receptor 

(Brakeman, P. R., et al., Nature 386:284-288 (1997)), and 
between SRY and SIP-1 (Poulat, F., et al. J. Biol. Chem. 
272:7167-7172 (1997)) were confirmed and the target 
proteins of the proteins having the PDZ domain were 

10 identified using this yeast 2 hybrid system. 

It is also possible to screen the proteins by the 
"west-western blotting method" (Skolnik, E. Y., Margolis, 
B., Mohammadi, M. , Lowenstein, E., Fischer, R., Drepps, A., 
Ullrich, A., and Schlessinger, J., Cloning of PI3 kinase- 

15 associated p85 utilizing a novel method for expression/ 

cloning of target proteins for receptor tyrosine kinases. 
Cell 65:83-90 (1991)). In this method, a cDNA library is 
prepared using a phage vector (such as Agtll and ZAP) from 
the cells expected to express the proteins that bind to the 

20 proteins of the present invention (e.g., human umbilical 

vascular endothelial cells) . The protein is then expressed 
on an LB-agarose, and the expressed proteins are fixed onto 
a filter with which the protein of the present invention 
that has been biotin-labeled or purified as a fusion 

25 protein with the GST protein is reacted. The plaques 
expressing the binding proteins are detected with 
streptavidin or an anti-GST antibody. It is then also 
possible to introduce the isolated genes from the above 
procedure into E. coli or other cells to express them and 

30 to prepare the proteins encoded by the genes. 

It should be possible to determine the signal 
transduction pathways mediated by the protein-protein 



-24- 



Attorney Docket: 06501/056001 



interaction by using the proteins of the present invention 
to isolate and analyze their binding proteins and the genes 
encoding them. Furthermore, as the relationship between 
the signal transduction and diseases becomes clearer, it 

5 will be possible to develop pharmaceuticals targeted at the 
proteins of the present invention and the proteins that 
interact with them. 

It is also expected that treatments using antisense 
DNA against the DNA encoding these proteins will become 

10 possible. In the present invention, "antisense DNA" refers 
to the DNA encoding the RNA that is complementary to the 
transcription product of the target gene, thereby employing 
the activity to suppress the expression of the target gene. 
Antisense DNA does not have to be perfectly complementary 

15 to the transcription product of the target gene, as long as 
it can effectively block the expression of the target gene. 
It preferably possesses 90% or more, and more preferably 
95% or more, complementarity. The chain length of the 
antisense DNA is 15 nucleotides or more, preferably 

20 100 nucleotides or more, and more preferably 

500 nucleotides or more. Various modified antisense 
oligonucleotides are being utilized as antisense DNA. For 
example, phosphorothioates (S-oligos) are preferable in 
terms of stability and solubility. The methods for 

25 introducing antisense DNA include direct administration, 
lipofection, the HVJ method, and the HVJ-liposome method. 
It is also possible to perform the treatment with antisense 
RNA using vectors. In this case, the gene therapy is 
achieved by inserting the DNA of the present invention 

30 backwards into the vector used in the recombinant protein 
production in animal cells described above. The DNA is 
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then expressed within the body by introducing it through 
direct administration, lipofection, the HVJ method, the 
HVJ-liposome method, etc. It is also possible to employ 
the methods of gene introduction using virus vectors such 
5 as adeno-associated virus, Adenovirus, human herpes simplex 
virus, vaccinia virus, and Fowlpox virus, in order to 
express the antisense RNA within the body. Treatments 
using ribozymes, instead of antisense DNA, are also 
possible . 

10 

Brief Description of the Drawings 
Figure 1. Sequence comparisons between "32-8-1" (top) 
and "AF00168" (bottom) are shown. 

Figure 2. Sequence comparisons between "32-8-1" (top) 
15 and "AJ001319" (bottom) are shown. 

Figure 3 Sequence comparisons between "32-8-1" (top) 
and "AJ001320" (bottom) are shown. 

Figure 4. Continuation of Figure 3, sequence 
comparisons between "32-8-1" (top) and "AJ001320" (bottom) 
20 are shown. 

Figure 5. Photograph of the electrophoresis showing 
the results of northern blot analysis of the "32-8-1 gene" 
is shown. The BamHI-Xbal fragment was used as the probe. 
"H" in the figure indicates the results with the Human 
25 Multiple Tissue Northern (MTN) Blot (Clontech #7760-1) . 
The lanes are 1. Heart, 2. Brain, 3. Placenta, 4. Lung, 
5. Liver, 6. Skeletal muscle, 7. Kidney, and 8. Pancreas. 
"H4" indicates the results with the Human Multiple Tissue 
Northern (MTN) Blot IV (Clontech #7766-1), and the lanes 
30 are 1. Spleen, 2. Thymus, 3. Prostate, 4. Testis, 

5. Uterus, 6. Small intestine, 7. Colon, and 8. Peripheral 
Blood Leukocyte. "F2" indicates the results with the Human 
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Fetal Multiple Tissue Northern (MTN) Blot II (Clontech 
#7756-1), and the lanes are 1. Fetal brain, 2. Fetal lung, 
3. Fetal liver, and 4. Fetal kidney. 

Figure 6. Photograph of the electrophoresis showing 

5 the results of northern blot analysis of the "32-8-1 gene'' 
is shown. The Ndel 1.2 kb-#l probe was used. "H" in the 
figure indicates the results with the Human Multiple Tissue 
Northern (MTN) Blot (Clontech #7760-1), and the lanes are 
1. Heart, 2. Brain, 3. Placenta, 4. Lung, 5. Liver, 

10 6. Skeletal muscle, 7. Kidney, and 8. Pancreas. "H4" 
indicates the results with the Human Multiple Tissue 
Northern (MTN) Blot IV (Clontech #7766-1), and the lanes 
are 1. Spleen, 2. Thymus, 3. Prostate, 4. Testis, 

5. Uterus, 6. Small intestine, 7. Colon, and 8. Peripheral 
15 Blood Leukocyte. "F2" indicates the results with the Human 

Fetal Multiple Tissue Northern (MTN) Blot II (Clontech 
#7756-1), and the lanes are 1. Fetal brain, 2. Fetal lung, 
3. Fetal liver, and 4. Fetal kidney. "Mu" indicates the 
results with the Human Muscle Multiple Tissue Northern 

20 (MTN) Blot (Clontech #7765-1), and the lanes are 
1. Skeletal muscle, 2. Uterus, 3. Colon, 4. Small 
intestine, 5. Bladder, 6. Heart, 7. Stomach, and 
8. Prostate. W C" indicates the results with the Human 
Cancer Cell Line Multiple Tissue Northern (MTN) Blot 

25 (Clontech #7757-1), and the lanes are 1. Promyelocytic 
leukemia HL-60 cells, 2. HeLa S3 cells, 3. Chronic 
myelogenous leukemia K-562 cells, 4. Lymphoblastic leukemia 
MOLT-4 cells, 5. Burkitt's lymphoma Raji cells, 

6. Colorectal adenocarcinoma SW480 cells, 7. Lung carcinoma 
30 A549 cells, and 8. Melanoma G361 cells. 

Figure 7. Positional relationships among various 
clones isolated by the present inventors are presented. 
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These are the "32-8-1" cDNA clone; the heart cDNA derived 
"686-1-2" and "686-1-4" clones. The fetal liver cDNA 
derived "FL #5," "#12," and "#6" clones are also shown. 
The PDZ domains encoded by the 32-8-1 gene are indicated by 

5 circles. The translation initiation site at nucleotide 292 
and the translation termination site at nucleotide 4410 are 
also indicated in the figure. The positions of the probes, 
Ndel 1.2 kb-#l and BamHI-Xbal, are also shown. 

Figure 8. PDZ domain sequences of the protein (SEQ ID 

10 NO: 1) encoded by the 32-8-1 gene are shown. The PDZ 

domain sequences that exist within the protein encoded by 
the 32-8-1 gene are aligned. 

Figure 9. Four colonies of E. coli transf ormants 
expressing GST-PDZ56 were picked, and the expression was 

15 compared depending on the presence or absence of the 

isopropyl thiogalactoside (IPTG) induction. Transf ormants 
with pGST-2TK were used as a control. The samples from 
each clone were analyzed on a 10% to 20% SDS-polyacrylamide 
gel, with even-numbered lanes before the IPTG induction and 

20 odd-numbered lanes three hours after the IPTG induction. 

Lanes 2 and 3 correspond to the pGST-2TK transf ormants, and 
lanes 4 through 11 correspond to clone 1 through 4 of the 
E. coli transf ormants expressing GST-PDZ56 . Lane 1 shows 
molecular weight markers. The bands corresponding to the 

25 induced expression of GST-PDZ56 are indicated with an 
arrow. 

Figure 10, The same samples used in the experiment 
shown in Figure 9 were analyzed by western blot. Bands 
(indicated with an arrow) corresponding to the induced 
30 expression of the 55 kDa protein were detected with the 
anti-GST antibody. The bands near 30 kDa seen in the 
samples three hours after the IPTG induction are 
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interpreted to represent the degradation of the GST-PDZ56 
protein. 

Figure 11. The expression of GST-PDZ14 from the 
E. coli transformants three hours after the IPTG induction 
was analyzed by Coomassie blue staining. Lanes 2 and 6 
correspond to the samples prior to the IPTG induction; 
lanes 3 through 6 correspond to clones 1, 2, 3, and 4 of 
the E. coli HB101 transformants; and lanes 8 through 11 
correspond to clones 1, 2, 3, and 4 of the E. coli JM109 
transformants, showing the results of GST-PDZ14 expression 
after the IPTG induction (arrow) . Lane 1 shows molecular 
weight markers. 

Figure 12. The purification process of PDZ56 is shown. 
Coomassie blue staining was used. Lane 1 shows molecular 
weight markers. Lane 2 corresponds to the culture media; 
lane 3, to the sonicated sample; lane 4, to the fraction 
unbound to the glutathione-Sepharose column; lanes 5 
through 7, to the washes; and lanes 8 and 9, to the PDZ56 
protein not containing the GST protein portion, which has 
come off the glutathione-Sepharose column after digestion 
by thrombin. Bands at approximately 30 kDa can be clearly 
seen (arrow indicates PDZ56) . Lane 10 corresponds to the 
GST protein portion bound to the glutathione-Sepharose 
column, which was eluted after digestion by thrombin (arrow 
indicates GST) . Lanes 11 and 12 show the GST-PDZ56 fusion 
protein that was eluted without thrombin digestion in a 
regular elution buffer containing glutathione (arrow 
indicates GST-PDZ56) . 

Figure 13. The results of western blotting performed 
with anti-GST antibody using a filter onto which the same 
samples used in the experiment shown in Figure 12 were 
blotted are shown. Comparing lanes 8 and 9 with lane 10 
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(arrow: GST) clearly shows that the 55kDa GST-PDZ56 fusion 
protein (arrow indicates GST-PDZ56) shown in lanes 11 and 
12 has been cleaved by thrombin to yield only PDZ56 that 
does not contain the GST portion. Bands in lanes 8 and 9 

5 of Figure 12 cannot be detected by the GST antibody used in 
Figure 13 because they do not contain GST. 

Figure 14. The purification process of PDZ14, similar 
to that in Figure 9, is shown. Lane 1 shows molecular 
weight markers. Lane 2 corresponds to the culture media; 

10 lane 3, to the sonicated sample; lane 4, to the fraction 
unbound from the glutathione-Sepharose column; lanes 5 
through 8, to the washes; and lanes 9, 10, and 11, to the 
PDZ14 protein not containing the GST protein portion, which 
came off the glutathione-Sepharose column after digestion 

15 by thrombin. Bands at 65 kDa can be clearly seen (arrow 
indicates PDZ14) . However, degradation products of the 
PDZ14 protein were also detected at 28 kDa and 37 kDa 
(arrows indicate 37 kDa and 28 kDa) . 

Figure 15. Out of the Protein Medley (Clontech) , the 

20 filters blotted with lOOmg each of the cell lysates from 
human testis (T) , skeletal muscle (Sk) , liver (Lv) , 
heart (H) , and brain (B) were reacted with the antisera 
from the rabbits immunized with peptide 32-8-1-17, PDZ14, 
or PDZ56 for western blotting. The filters were reacted 

25 sequentially with the 5,000-fold diluted rabbit antiserum, 
the 1,000-fold diluted biotin-labeled anti-rabbit Ig 
antibody, and the 2,500-fold diluted horseradish peroxidase 
(HRP) -labeled streptavidin-biotin complex (Amersham) . The 
results of detection by chemiluminescence of the proteins 

30 that react with the rabbit antisera are shown. In the 

liver tissue, the present inventors were able to detect a 
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band at around 130 kDa. This band is expected to have been 
derived from the 32-8-1 protein (arrow) . 

Figure 16, The results of an analysis of the tissue 
specificity of the 32-8-1 gene expression by RT-PCR are 

5 shown. The 24-types of first strand cDNAs used were 
1. brain, 2. heart, 3. kidney, 4. liver, 5, lung, 
6. pancreas, 7. placenta, 8. skeletal muscle, 9. colon, 
10, ovary, 11. peripheral leukocyte, 12. prostate, 
13, small intestine, 14. spleen, 15. testis, 16. thymus, 

10 17. fetal brain, 18. fetal heart, 19. fetal kidney, 

20. fetal liver, 21. fetal lung, 22. fetal skeletal muscle, 
23. fetal spleen, and 24. fetal thymus. Single bands at 
650 bp were detected in panel A, and three bands (750 bp, 
850 bp, and 950 bp) were detected in panel B. 

15 Figure 17. The comparisons among the sequences of 

FH750, FH850, and FH950 are shown. 

Figure 18. The continuation of Figure 17 showing the 
comparisons among the sequences of FH750, FH850, and FH950 
is shown. 

20 Figure 19. A photograph of an electrophoresis 

presenting the results of detection of the 32-8-lb protein 
by western blotting is shown. Lanes 1 and 2 were detected 
with the antisera against the 32-8-1-17 peptide, and 
lanes 3 and 4 were detected with the antisera against 

25 PDZ56. Cell lysates from neuroblastoma cells SH-SY5Y 

(lanes 1 and 3) and the NT-N cells (lanes 2 and 4), which 
are neurons differentiated from NT-2 by the retinoic acid 
stimulation, were separated on an SDS-polyacrylamide gel. 
Bands that are expected to correspond to the 32-8-lb 

30 protein were detected with a size of 250 kDa or more. 

Figure 20. Sequence comparisons between "32-8-lb" 
(top) and "AF00168" (bottom) are shown. 
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Figure 21. Sequence comparisons between "32-8-lb" 
(top) and "AJ001319" (bottom) are shown. 

Figure 22. Sequence comparisons between "32-8-lb" 
(top) and "AJ001320" (bottom) are shown. 

Figure 23. The continuation of Figure 22, which 
presents sequence comparisons between "32-8-lb" and 
"AJ001320," is shown. 

Figure 24. The continuation of Figure 23, which 
presents sequence comparisons between "32-8-lb" and 
"AJ001320," is shown. 

Figure 25. The sequences of the PDZ domains in the 
protein (SEQ ID NO: 83) encoded by the 32-8-lb gene are 
shown. The sequences of the PDZ domains that exist within 
the protein encoded by the 32-8-lb gene are aligned. 

Figure 26. The positional relationships among the 
various clones isolated by the present inventors are shown. 
These are the "32-8-1" cDNA clone; the heart cDNA derived 
"686-1-2" clone, "686-1-4" clone, and "FH950" clone; the 
fetal liver cDNA derived "FL#5," "#12," and "#6" clones; 
and the brain derived "1.2 kb #33" clone and "D-2" clone 
are shown. The PDZ domains encoded by the "32-8-lb" gene 
are indicated by rectangles. 

Best Mode for Implementing the Invention 
Embodiments of the present invention are exemplified 
below. However, the present invention shall in no way be 
limited by these examples. 

Example 1 Cloning of genes 
(1) Differential display 

The human umbilical vascular endothelial cells (HUVEC) 
were obtained from Morinaga Biochemistry Research Institute 
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and cultured by using the Normal Human Vascular Endothelial 
Cell Culturing kit (Catalog #680051) . When the cells became 
subconf luent, 10 ng/ml Recombinant Human Tumor Necrosis 
Factor-a (TNFa, Catalog #300-01A, PEPROTECH Inc.) was added, 

5 and the cells were cultured for another 24 hours. The 
expressed genes were compared with those from the cells 
without the addition of TNFa. Cells were detached from the 
plate with trypsin-EDTA, precipitated by centrif ugation at 
1,000 rpm for 5 minutes, and washed once with PBS. The 

10 total RNA was then recovered by using an RNAeasy Total RNA 
kit (QIAGEN) . Using 0.2 \iq of the recovered total RNA, the 
present inventors synthesized cDNA by means of the H-T11G 
anchor primer. The conditions were based on those given in 
the manual for the RNAimage kit (GenHunter) . Genes were 

15 randomly amplified using the TAKARA Taq polymerase through 
40 cycles of polymerase chain reaction (PCR) . Each cycle 
consisted of 94°C for 30 seconds, 40°C for 2 minutes, and 
72 °C for 30 seconds, for each of the eight kinds of 
arbitrary primers H-AP1 through H-AP8 . The reaction mixture 

20 contained a- 32 P dATP. The products were separated on 
sequencing gels, and those genes whose bands were 
intensified by the TNFa stimulation, that is, the genes 
whose mRNA expression was increased as compared to the case 
with no stimulation, were amplified again with the same 

25 conditions. The primer DNA was then removed from the 

reaction mixture using a Qiaquick Spin PCR Purification kit. 
The nucleotide sequence information of "DDEST32" shown in 
SEQ ID NO: 13 was obtained by analyzing the products with a 
Dye Terminator Cycle Sequencing FS Ready Reaction kit 

30 (Perkin Elmer, Catalog #402122) using the same primers used 
for amplification. 
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(2) Construction of cDNA library 

A cDNA library was constructed using a ZAP-cDNA 
synthesis kit (Stratagene) . A 10 x 1 st strand buffer 
(5 pi), 3 yl of 1 st strand methyl nucleotides mix, 2 pi of 

5 linker-primer (1,4 pg/pl) , 1 ml of RNase Block ribonuclease 
inhibitor (40 U/pl) , 10 pi of TNFa-stimulated HUVEC poly A + 
mRNA (0.5 pg/pl), and 24 pi of diethyl pyrocarbonate 
(DEPC) -treated water were gently mixed and allowed to stand 
at room temperature for 10 minutes. Superscript II reverse 

10 transcriptase (5 pi, 200 U/pl, GIBCO-BRL) was mixed with 
the cDNA library. The mixture was incubated at 37 °C for 
40 minutes then at 45 °C for 70 minutes. The reaction 
mixture was put on ice, and 20 pi of 10 x 2 nd strand buffer, 
6 pi of 2 nd strand nucleotide mix, 115.9 pi of sterilized 

15 distilled water, RNase H (1.5 U/pl), and 11.1 pi of DNA 

polymerase I (9 U/pl) were mixed into 45 p of the reacted 
mixture by vortexing, and the mixture was incubated at 16°C 
for 150 minutes. After the reaction, 23 pi of blunting 
dNTP mix and 2 pi of cloned Pfu DNA polymerase (2.5 U/pl) 

20 were added, and the mixture was incubated at 72 °C for 

30 minutes. The mixture was then sequentially extracted 
with 200 pi of phenol/chloroform, and with chloroform, and 
further precipitated by adding 20 pi of 3M sodium acetate 
and 400 pi of 100% ethanol. After overnight incubation at 

25 -20°C, the mixture was centrifuged at 15,000 rpm for 
60 minutes (4°C), and the precipitate was washed with 
500 pi of 70% ethanol and dried. The precipitate was 
dissolved in 9 pi of 0 . 4 pg/pl EcoRI adapter and incubated 
at 4°C for 45 minutes. 10 x ligase buffer (1 pi), 1 pi of 

30 10 mM ATP, and 1 pi of T4 DNA ligase (4 U/pl) were then 

added to the above, and the ligation reaction was performed 
overnight at 8°C. The mixture was incubated at 70 °C for 



-34- 



Attorney Docket: 06501/056001 



30 minutes to inactivate the enzyme, spun down to collect 
the solution on the bottom of the tube, and let sit for 
5 minutes at room temperature. To the mixture were added 
1 pi of 10 x ligase buffer, 2 pi of ATP, 6 pi of sterilized 

5 water, and 1 pi of T4 polynucleotide kinase (10 U/pl) . The 
mixture was incubated at 37 °C for 30 minutes then incubated 
again at 70 °C for 30 minutes to inactivate the enzyme. 
Xhol buffer supplement (28 pi) and 3 pi of Xhol (40 U/pl) 
were added to the above, and the mixture was reacted at 

10 37 °C for 90 minutes. The mixture was cooled to room 

temperature, then 5 pi of 10 x STE buffer was added. The 
mixture was then applied to a Sephacryl S-500 column and 
eluted twice with 60 pi of 1 x STE buffer. Ethanol 
(120 ml) was then added to the mixture, and the mixture was 

15 allowed to stand at -20°C overnight. It was then 

centrifuged at 15,000 rpm for 60 minutes (4°C) to obtain 
the precipitate. The precipitate was washed with 200 pi of 
80% ethanol and dried. It was then dissolved with 6 pi of 
sterilized water, and 2.5 pi of it was used for the 

20 ligation reaction with the vector. To 2.5 pi of the cDNA, 
1 pi of Uni-ZAP XR vector (1 pg) , 0.5 pi of 10 x ligase 
buffer, 0.5 pi of 10 mM ATP, and 0.5 pi of T4 DNA ligase 
(4 U/pl) were added and reacted at 12°C overnight. The 
ligation mixture (1 pi) was added to the Gigapacklll Gold 

25 Packaging extract, mixed well, and incubated for two hours 
at room temperature. SM buffer (500 pi; 5.8 g NaCl, 2.0 g 
MgS0 4 -7H 2 0, 50 ml 1 M Tris-HCl (pH 7.5), and 5 ml 2% (w/v) 
gelatin, brought up to 1 L with deionized water) was added 
to the above, and after 20 pi of chloroform was added, it 

30 was gently mixed. The mixture was then centrifuged, and 

the supernatant was transferred to another tube and stored 
at 4°C. The phage titer was measured using 0.1 pi and 1 pi 
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of the packaging reaction. Since approximately 300 plaques 
were obtained from 0.1 jil, the titer was estimated to be 
3,000 PFU (plaque-forming units) per microliter. XL1 Blue 
MRF' was used as the host E. coli. It was cultured in 

5 20 ml LB/ 10 mM MgS0 4 / 0.2% maltose at 37°C for 6 hours, 

placed on ice for 5 minutes before the OD 6 oo became 1.0, and 
centrifuged at 500 x g for 10 minutes. To resuspend the 
precipitated cells, 10 ml of 10 mM MgS0 4 was added, and 
cells were diluted with 10 mM MgS0 4 so that the OD 60 o became 

10 0.5. The packaging reaction (17 ]il) was added to 600 ]il of 
the freshly prepared XL-1 Blue MRF', and incubated at 37 °C 
for 15 minutes. NZY top agar (6.5 ml; made by adding 0.7% 
(w/v) agarose to the NZY medium and autoclaving it) , which 
had been incubated at 45 °C, was added to the above and 

15 plated onto NZY agar plates. The plates were prepared as 
follows. NaCl (5 g) , MgS0 4 -7H 2 0 (2.0 g) , yeast extracts 
(5 g) , NZ amines (10 g) , and agar (15 g) were dissolved in 
deionized water to make the total volume 1 L. The solution 
pH was adjusted with NaOH to 7.5, after which the solution 

20 was autoclaved and poured into sterilized culture plates. 
After culturing at 37°C for six hours, the plaques were 
transferred onto a Hybond N + filter (Amersham, RPN203B) by 
placing the filter on the plate, denatured with 1.5 M NaCl- 
0.5 M NaOH for 7 minutes, neutralized by treating with 1.5 

25 M NaCl-0.5 M Tris-HCl (pH 7.2)/ 1 mM EDTA for 5 minutes, 
and finally rinsed with 2 x SSC. After the filter was 
dried, the plaques were fixed onto the filter by 
StrataLinker (Stratagene) . 



30 (3) Screening of the cDNA library 

The "DDEST32" DNA fragment was isolated on a 2% 
agarose gel, and recovered from an agarose gel slice with a 
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QIAEX II gel extraction kit (QIAGEN) . The fragment was 
labeled by random labeling so it could be used as the 
probe. Using a Megaprime kit (Amersham, RPN1607), 5 yl of 
primer solution was added to 25 ng of the probe DNA, and 

5 incubated at 95°C for 5 minutes. After the solution was 
incubated at room temperature, 10 ]il of labeling buffer, 
18 til of water, 5 ]il of a 32 P dCTP (-3000ci/mmol; Amersham) , 
and 2 ]il of Klenow fragment were added to it, and the mixed 
solution was incubated at 37 °C for 30 minutes. The 

10 reaction was stopped by adding 2 of 0.5 M EDTA, and the 
free a- 32 P dCTP was removed with a Pharmacia ProbeQuant G-50 
column. After prehybridization at 60 °C in the Rapid hybri 
buffer (Amersham, RPN1636) , the labeled probe was heat- 
denatured at 95 °C, rapidly chilled on ice, and added to the 

15 hybridization buffer. Hybridization was then performed by 
shaking at 60 °C for two hours. The probe was used at a 
concentration of 2 x 10 6 cpm/ml. The filter was washed 
three times in 2 x SSC/ 0.05% SDS at room temperature for 
ten minutes each, and twice more in 0.1 x SSC/ 0.1% SDS at 

20 60°C for 20 minutes each. The phage collected from 

positive plaques was diluted in SM buffer and plated onto 
10-cm plates so that approximately 100 plaques were formed 
per plate. The secondary and tertiary screenings were 
performed similarly. As a result, clone "#32-8-1" was 

25 obtained as the positive clone. The gene that had been 
cloned in the Uni-ZAP vector was recovered as ordinary 
plasmid DNA by the in vivo excision method. 



Example 2 Sequence determination of the "32-8-1" gene 
30 (1) Preparation of the cDNA library for RACE 

The cDNA for RACE was synthesized using a Marathon 
cDNA amplification kit (Clontech) . The total RNA (1 ]iq) 
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obtained from the TNFa-stimulated HUVEC cells was used for 
the experiment. Oligo dT primer (1 pi, 10 pM) was added to 
the above, the total volume was brought to 5 pi, and the 
mixture was incubated at 7 0°C for 2 minutes and then placed 

5 on ice for 2 minutes. Two microliters of 5 x 1 st strand 
buffer, 1 pi of 10 mM dNTP mix, and 1 pi of 100 U/pl MMLV 
reverse transcriptase were added to the above, and the 
total volume was made 10 pi. The mixture was then 
incubated at 42 °C for 1 hour to synthesize the first strand 

10 cDNA. Sixteen microliters of 5 x 2 nd strand buffer, 1.6 pi 
of 10 mM dNTP mix, and 4 pi of 20 x 2 nd strand enzyme 
cocktail were next added to the mixture, the total volume 
was adjusted to 80 pi with water, and the mixture was 
incubated at 16°C for 90 minutes. T4 DNA polymerase (2 pi, 

15 5 U/pl) was then added, and the reaction was performed at 

16°C for 45 minutes. After 4 jil of 20 x EDTA/ glycogen was 
added to the mixture, it was deproteinized with equal 
volumes of phenol/chloroform, and isoamyl 
alcohol /chloroform. 

20 Ethanol precipitation was done with 35 pi of 4 M 

ammonium acetate and 263 pi of 95% ethanol, and the 
precipitate was washed with 80% ethanol and spontaneously 
dried for 10 minutes. The dried precipitate was dissolved 
in 10 pi of deionized water, and 7.5 pi was used for the 

25 adapter ligation reaction. Marathon cDNA adapter (3 pi, 
10 pM) , 3 pi of 5 x DNA ligation buffer, and 1.5 pi of T4 
DNA ligase (1 U/pl) were added to the above and reacted 
overnight at 16 °C. The enzyme was inactivated by 
incubation at 7 0°C for 5 minutes, and the total volume was 

30 adjusted to 150 pi by using 135 pi of Tricine-EDTA buffer 
contained in the kit. 
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(2) cDNA cloning by RACE and sequence determination 

Clone #32-8-1 was subcloned by utilizing the 
restriction enzyme recognition sites within the gene (PstI, 
Xbal, BamHI, and EcoRI), and the nucleotide sequence was 
determined by the cycle sequence method using a Dye 
Terminator Cycle Sequencing FS Ready Reaction kit (Per kin 
Elmer, Catalog #402122). 

The sequences of the primers used are shown in Table 
1 . n C" indicates a primer for the complementary DNA 
strand. 

Table 1 



Primer # DNA sequence SEQ ID NO: 



i nfi 




c 
\ — 


CTCCCCATCCCTCGTCCACC 


14 


A Hi 






J. J. X uiiU X \jr\\^ X \J\J 


15 








r\ 1 \3r\\J ill uU X X riV--rl\3\^ X uu 




ADO 








17 










X. o 


801 






ACT GT TACT ACT T C T GAT GC 


19 


1192- 


-1161 




TCTGATGGTCCCACAGTCTG 


20 


1282 




c 


GTTGTTTCGCAGCCAGGGAT 


21 


1524 






CTGAGCATCGTTGGGGGTTC 


22 


1449 




c 


CCTCATCTCTGTAGAGTGTC 


23 


1683 






T GT TAGCCCCCTCACTAAGG 


24 


1803 






GCTATGTGCTAGGAAATACG 


25 


2116 






TAGGGAGAAGGATCAGAGCG 


26 


607- 


93 




ACAGATTTCTGACTCACTGG 


27 


128 






TGGAAATAGGCATTCTTCAG 


28 


607- 


462 




ATACAAAGACGGTCTAATCC 


29 


2920 




c 


CCGCTTTCCCATCTTTAGAAAC 


30 


3121 






TATCTCGTGTGGAAGATGTG 


31 


2266 


-107 


c 


ACATAAATGTTGCTATCACC 


32 


3361 






TGCCACTTAGTAGCCGAGTG 


33 


3615 






GCATTGCATTACAGTTGAGC 


34 


1301 




c 


TCCTCCTTTGACAATGTCTG 


35 


BXR 




c 


CATTTCGACTGTTCTTAATC 


36 


XB 




c 


TCAGTGGATGTGCCACAGAT 


37 


4221 




c 


CAGTAGGTTAACTGCTTCGG 


38 


BX 






AGTTCCAGTCTTTCTTTCGG 


39 


4335 






TTTCTTTCACTGGGCTGAAGTC 


40 


XBR 






CCTCTGAAGACGGACGTCTG 


41 
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Accordingly, a nucleotide sequence of 5,146 bp was 
determined. When the first G of the EcoRI recognition site 
was counted as nucleotide 1, the PDZ domain started at 
nucleotide 4 68. A stop codon immediately followed three 

5 repeated stretches of approximately 8 0 amino acids. The 
sequence in the 3' region of the gene also contained three 
PDZ domains at a distance of approximately 2 kb from the 
stop codon described above. (An experiment conducted later 
revealed that clone #32-8-1 contains a sequence of 

10 approximately 2 kb. This sequence has been derived from an 
intron, transcribed and inserted, thereby introducing the 
stop codon immediately after the first three PDZ domains.) 

Therefore, the present inventors performed 5' Rapid 
Amplification of cDNA Ends (RACE) starting from the 

15 position of the three PDZ domains found in the latter half. 
Using 5 ]il of the cDNA described above, 5' RACE was 
performed according to the manual contained in the kit. 
The reaction mixture consisted of 5 jil of cDNA, 5 ]il of 
10 x Advantage™ KlenTaq buffer (which came with the kit) , 

20 4 ]il of 2.5 mM dNTP, 1 pi of 10 ]iM API primer 

(CCATCCTAATACGACTCACTATAGGGC (SEQ ID NO: 42)), 1 jil of 
10 \xM 32-8-1 5' RACE primer #22 (TTGGGGTGGGGAGAGGAGGTAGATTGC 
(SEQ ID NO: 43)), 1 yl of Advantage™ KlenTaq polymerase mix 
(Toyobo, CLK8417-1), and 33 ]xl of deionized water to make 

25 the total 50 PCR reaction was performed using a Perkin 

Elmer Thermal Cycler 2400. A reaction consisting of 94 °C 
for 1 minute, five cycles of 94 °C for 5 seconds and 72 °C 
for 2 minutes, five cycles of 94°C for 5 seconds and 70°C 
for 2 minutes, followed by 25 cycles of 94 °C for 5 seconds 

30 and 68 °C for 2 minutes did not produce clearly detectable 

bands. By performing nested PCR under the same conditions, 
the present inventors were able to obtain a band of 
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approximately 1.8 kb. Here the AP2 primer 
(ACTCACTATAGGGCTCGAGCGGC (SEQ ID NO: 44)) and 32-8-1 5' 
RACE primer #1034 (GCACATCACCAAGTGGGCTGCCTACTC (SEQ ID 
NO: 45) ) were used as primers, and 5 pi of the 50-fold 
5 dilution of the initial PCR product was used. Also, the 
original 25 cycles of 94 °C for 5 seconds and 68 °C for 
2 minutes was reduced to 15 cycles. As a result, cDNA 
clone "32-8-1/5R3, " which does not contain the 2 kb gap, 
was obtained. 

10 Next, the present inventors determined the sequence of 

clone 32-8-1/5R3. The sequences of the primers used for 
the sequence determination of 32-8-1/5R3 are shown in 
Table 2. W C" indicates a primer for the complementary DNA 
strand. 



15 



Table 2 



Primer # 



DNA sequence 



SEQ ID NO: 



EX 

456 

XER 

678 

801 

1192-1161 

1282 

1524 

1449 

2116 

1301 

839 

1389 



C 



C 



C 



C 



C 



ATGAGTTTGGTTACAGCTGG 
AATCTAATGCAGCTCGCCTG 
AGTCTTGCTGGGAACAAAGA 
TCACTTTAGAAGGGGCACAT 
ACTGTTACTACTTCTGATGC 
TCTGATGGTCCCACAGTCTG 
GTTGTTTCGCAGCCAGGGAT 
CTGAGCATCGTTGGGGGTTC 
CCTCATCTCTGTAGAGTGTC 
TAGGGAGAAGGATCAGAGCG 
TCCTCCTTTGACAATGTCTG 
TTTCATCATCTACAGCCAGT 
TGACACCCTCACTATTGAGC 



46 
47 
48 
49 
50 
51 
52 
53 
54 
55 
56 
57 
58 



20 



The nucleotide sequence of 2,819 bp, which was 
determined by combining the sequences of clones #32-8-1 and 
32-8-1/5R3, is shown in SEQ ID NO: 59. 
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Example 3 Cloning of a cDNA clone corresponding to the 5' 
upstream region of the 32-8-1/5R3 cDNA clone by RACE 

The present inventors attempted to isolate the 
upstream cDNA located 5' to the 32-8-1/5R3 clone by the 5' 

5 Rapid Amplification of cDNA Ends (RACE) method. A human 
heart cDNA library and a human fetal liver cDNA library 
were used as cDNA sources. Two clones, 2.8 kb and 1.2 kb 
in size, were obtained from the human heart cDNA library. 
One 1.1 kb clone was obtained from the human fetal liver 

10 cDNA library. The cloning procedure is described below. 

The present inventors used cDNA Library Human Heart 
(Takara Shuzo, Catalog #9604) for the human heart cDNA 
library. The XL1 Blue-MRF' E. coli cells transformed with 
the plasmid DNA containing the cDNA inserted into the 

15 pAP3neo vector (Genbank Accession No.AB003468) were 

cultured by the usual method, the plasmid DNA was recovered 
by the alkaline method, and the cDNA clone containing the 
5' upstream region was obtained by PCR using 10 ng of the 
plasmid DNA as the template. The reaction mixture 

20 consisted of 10 ng of the cDNA, 5 ]il of 10 x Advantage™ 
KlenTaq buffer (which came with the kit), 4 pi of 2.5 mM 
dNTP, 1 \il of 10 ]iVL AP3neo5' primer (which came with the 
kit; 5' -GCCCTTAGGACGCGTAATACGACTC-3' (SEQ ID NO: 60)), 1 \il 
of 10 iaM 32-8-1 5' RACE primer #686 

25 (5'-AGCCAGTATCTGATCTCCGACTTTG-3' (SEQ ID NO: 61)), and 1 jil 
of Advantage™ KlenTaq polymerase mix (Toyobo, CLK8417-1) . 
These were mixed with deionized water to make the total 
50 PCR reaction was performed using a Perkin Elmer 

Thermal Cycler 2400. A reaction consisting of 94 °C for 

30 1 minute, 5 cycles of 94 °C for 5 seconds and 72 °C for 

4 minutes, and 5 cycles of 94 °C for 5 seconds and 70°C for 
4 minutes, followed by 25 cycles of 94 °C for 5 seconds and 
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68°C for 4 minutes, yielded the bands of 2.8 kb and 1.2 kb. 
The products were separated on a 0.8% agarose gel. The 
corresponding bands were excised and purified with the 
QIAquick gel extraction kit (QIAGEN, 28706) and were 

5 subjected to TA cloning according to the manual for the 
pGEM-T Vector System I (Promega, A3600) . The present 
inventors designated the two clones as 68 6-1-4 (2.8 kb) and 
686-1-2 (1.2 kb) . The sequence of clone 686-1-2 is 
contained in that of 686-1-4 (sequence 686-1-4), and ranges 

10 from nucleotide 1585 to nucleotide 2793 of SEQ ID NO: 3 
(Figure 7) . 

The present inventors performed 5' RACE using Marathon 
Ready human fetal liver cDNA (Clontech) as the human fetal 
liver cDNA library. The reaction mixture consisted of 5 pi 
15 of the cDNA, 5 pi of 10 x Advantage™ KlenTaq buffer (which 
came with the kit), 4 pi of 2.5 mM dNTP, 1 pi of 10 pM API 
primer (which came with the kit: 

5 ' -CCATCCTAATACGACTCACTATAGGGC-3 ' (SEQ ID NO: 42)), 1 pi of 
10 pM 32-8-1 5' RACE primer #686 

20 (5'-AGCCAGTATCTGATCTCCGACTTTG-3' (SEQ ID NO: 61)), and 1 ml 
of Advantage™ KlenTaq polymerase mix (Toyobo, CLK8417-1) . 
These were mixed with 33 pi of deionized water to make the 
total 50 pi. PCR reaction was performed using a Perkin 
Elmer Thermal Cycler 2400. A reaction consisting of 94 °C 

25 for 1 minute, 5 cycles of 94°C for 5 seconds and 72°C for 
6 minutes, 5 cycles of 94°C for 5 seconds and 70°C for 
6 minutes, and 25 cycles of 94 °C for 5 seconds and 68 °C for 
6 minutes did not produce clearly detectable bands. The 
reaction mixture was then diluted 50 fold, 5 pml of which 

30 was mixed with 5 pml of 10 x Advantage™ KlenTaq buffer 
(which came with the kit), 4 pi of 2.5 mM dNTP, 1 pi of 
10 pM AP2 primer (which came with the kit; 
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5'- ACTCACTATAGGGCTCGAGCGGC-3 (SEQ ID NO: 44)), 1 \il of 
10 jjM 32-8-1 5' RACE nested primer #FLN 

(5' -ATTTTCACTTTAGAAGGGGCACAT-3' (SEQ ID NO: 62)), 1 ]il of 
Advantage™ KlenTaq polymerase mix (Toyobo, CLK8417-1) , and 
5 33 \il of deionized water to make the total 50 pi. Nested 
PCR was performed at 94 °C for 1 minute, 5 cycles of 94 °C 
for 5 seconds and 72°C for 6 minutes, 5 cycles of 94°C for 
5 seconds and 70°C for 6 minutes, and 15 cycles of 94 °C for 
5 seconds and 68 °C for 6 minutes, which produced a band of 

10 approximately 1.1 kb. The products were separated on a 
0.8% agarose gel. The corresponding bands were then 
excised and purified with a QIAquick gel extraction kit 
(QIAGEN, 28706) and were subjected to TA cloning according 
to the manual for the pGEM-T Vector System I (Promega, 

15 A3600) . Three different clones were thus obtained and 
designated HFL#5, HFL#12, and HFL#6. HFL#5 and HFL#12 
started from nucleotide 1357 of SEQ ID NO: 3, while HFL#6 
started from nucleotide 1377 of SEQ ID NO: 3. Of course, 
all three contained the sequence up to that of primer #FLN, 

20 which was used in the RACE (Figure 7) , 

The nucleotide sequences were determined as described 
above, by means of the cycle sequence method using a Dye 
Terminator Cycle Sequencing FS Ready Reaction kit (Perkin 
Elmer, Catalog #402122) . The combined sequence of the 

25 previously determined one and the newly determined one is 
shown in SEQ ID NO: 3. Figure 8 shows the sequences of 
nine PDZ domains aligned. The primers used for the cycle 
sequencing method are listed in Table 3. 
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Table 3 



Primer # DNA sequence SEQ ID NO: 

68 6A GGCATAACTTTACTTACTTG 63 

68 6B ATCTACTAAGTCAGCATCAT 64 

68 6C ATTTGCAGGTGTGTAGTCAT 65 

68 6D TTCCTTCTGTGCTACCCGAT 66 

68 6E GGACTATCTTC C AGAAC AT G 67 



5 Example 4 Search for proteins having homology to the 
protein encoded by the "38-2-1" gene 

The BLASTN search and the BLAST P search detected u Mus 
musculus 90RF binding protein 1 (9BP-1) mRNA, partial cds." 
(LOCUS: MMAF000168, ACCESSION: AF000168) that consists of 

10 2703 bp as a gene having homology to the "32-8-1" gene. 

This gene was recorded toGenBank on 18 MAY 1997. The amino 
acid sequence of the protein encoded by the "32-8-1" gene 
(the sequence after amino acid 847 of SEQ ID NO: 1) and 
that of AF000168 are aligned and shown in Figure 1. In the 

15 figure, the amino acid 847 of SEQ ID NO: 1 was regarded as 
the "first" amino acid, and comparisons are shown with the 
amino acid sequence thereafter. 

In addition, "Rattus norvegius mRNA for multi PDZ 
domain protein" (LOCUS: RNMUPP1, ACCESSION: AJ001320) 

20 consisting of 7516 bp, and "Homo sapiens mRNA for multi PDZ 
domain protein" (LOCUS: HSMUPP1, ACCESSION: AJ001319) 
consisting of 1768 bp were detected as genes having 
homology. These genes were registered on 26 MAR 1998. 
The amino acid sequence of the protein encoded by the 

25 "32-8-1" gene (the sequence after amino acid 921 of SEQ ID 
NO: 1) and that of AJ001319 are aligned and shown in 
Figure 2. The amino acid sequence of the protein encoded 
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by the "32-8-1" gene (the sequence of SEQ ID NO: 1) and 
that of AJ001320 are aligned and shown in Figures 3 and 4. 

Example 5 Analysis of tissue specificity of expression by 
5 northern blotting 

Clontech Human Multiple Tissue Northern (MTN) Blot 
(Catalog #7760-1), Human MTN Blot IV(Catalog #7766-1), 
Human Fetal MTN Blot II (#7756-1), Human Muscle MTN Blot 
(#7765-1) and Human Cancer Cell Line MTN Blot (#7757-1) 

10 were used to analyze the tissue specificity of gene 

expression. Northern blot was performed according to the 
standard method, using the BamHI-Xbal fragment (from 
position 3709 to position 4337 of SEQ ID NO: 3) as the 
probe (see Figure 7 for the position of the probe) , and 

15 25 ng of the DNA fragment was labeled with a- 32 P dCTP using 
a Megaprime DNA labeling kit (Amersham, Catalog RPN1607) . 
These MTN Blots were prehybridized in 5 ml of the 
ExpressHyb hybridization solution (Clontech, Catalog #8015- 
2) at 68°C for 30 minutes, and then hybridized with 1 x 

20 10 7 cpm of the labeled probe also in 5 ml of the ExpressHyb 
hybridization solution (2 x 10 6 cpm/ ml) at 68 °C for 
2 hours. The filters were washed three times in 2 x SSC 
(0.3 M NaCl, 0.03 M sodium citrate (pH 7.0))/ 0.05% SDS at 
room temperature for 10 minutes each, washed twice more in 

25 0.1 x SSC/ 0.1% SDS at 50°C for 15 minutes each, exposed on 
FUJI imaging plates overnight, and analyzed by a FUJI 
BAS2000. As shown in Figure 5, strong expression of the 
approximately 8 kb transcription product was detected in 
the heart, placenta, skeletal muscle, fetal brain, fetal 

30 lung, fetal kidney, small intestine, bladder, stomach, 
prostate, HeLa S3 cells, lung cancer A549 cells, and 
melanoma G361 cells. However, the expression was either 
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absent or weak in the lung, lymphoid tissues (spleen and 
thymus), and cell lines (lanes 1, 3, 4, 5, and 6 of 
blot C) . In the heart, liver, kidney, and fetal liver, a 
5.5 kb transcription product was expressed. 

5 Similarly, northern blot analysis was performed using 

the Ndel 1.2 kb-#l probe (from position 1 to position 1091 
of SEQ ID NO: 3) (see Figure 7 for the position of the 
probe). However, the band corresponding to the 5.5 kb 
transcription product was not detected (Figure 6) . 

10 Considering this and the fact that the cDNAs cloned by 5' 
RACE from the fetal liver only contained the 5' sequences 
1,357 and 1,377 nucleotides downstream from the 5' end of 
the transcription product expressed in the heart 
(Figure 7), it can be deduced that the difference of the 

15 transcription initiation sites in the heart and liver 

caused the difference in the lengths of the transcription 
products. Therefore, the peptide encoded by the 32-8-1 
gene that is expressed in the liver is expected to start 
with the first methionine encoded by the ATG codon 

20 beginning with the 1396 th nucleotide. This results in the 
transcription product from the liver consisting of 1,005 
amino acids, compared with that from the heart consisting 
of 1,373 amino acids. Consequently, it does not contain 
PDZ domain E and is shorter by 368 amino acids. Although 

25 the biological significance of not having PDZ domain E is 
unclear at present, it is highly possible that this 
protein, by lacking this portion, is involved in a 
different signal regulation in the liver cells than in the 
other tissues since PDZ domains are important for protein- 

30 protein interactions 
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Example 6 Expression of the 32-8-1 protein in E. coli 
(1) Construction of the expression vector 

In order to express the 32-8-1 protein in E. coli as a 
fusion protein with the glutathione-S-transf erase (GST) 
5 protein, part of the 32-8-1 gene was ligated to the 

carboxyl terminus of the GST gene in Pharmacia's pGEX-2TK 
(Genbank Accession U13851) . The vector was constructed 
based on the method of W. Dietmaier et al. for the di-/tri- 
nucleotide sticky end cloning described in the PCR 

10 Application Manual (Boehringer Mannheim) . pGEX-2TK (1 pg) 
was reacted in a mixture of 2 ml of 10 x High buffer and 
20 units each of restriction enzymes EcoRI and BamHI in a 
total volume of 20 pi at 37 °C for 3 hours. Proteins were 
removed by using a QIAquick column (QIAGEN) according to 

15 the manual, and the purified DNA was eluted with 30 pi of 

distilled water. 10 x Klenow buffer (3 pi; 100 mM Tris-HCl 
(pH 7.5), 70 mM MgCl 2 , 1 mM DTT) , which comes with Takara' s 
Klenow enzyme, and 1 . 5 pi of 2 mM dGTP were mixed with 
27 pi of the above. Four units of Klenow enzyme were then 

20 added, and the reaction was allowed to proceed at room 
temperature for 15 minutes. After the enzyme was 
inactivated by heating at 75°C for 15 minutes', the DNA was 
purified by deproteinizing with a QIAquick column (QIAGEN) 
according to the manual. 

25 The region of the 32-8-1 gene to be expressed, which 

encodes amino acids 1112 to 1373, was amplified by PCR 
using 50 ng of #32-8-1 DNA as the template. The 
amplification reaction was done by adding 5 pi of 10 x 
Reaction buffer #1 for KOD DNA polymerase (Toyobo) , 5 pi 

30 each of 10 pM primer 502-508 ( 5 ' -ATCGGGTCCATTCCATTCAGAGAGG- 
3' (SEQ ID NO: 68)) and 10 pM primer 758-763E 
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( 5 ' -AATTGTCAAGAGAGAACCATCAAAGTGG-3 ' ( SEQ ID NO: 69)), 4 \xl 
of 2.5 mM dNTP, 2 ]il of 25 mM MgCl 2 , and 27 ]il of sterilized 
water. 2.5 ]il of KOD DNA polymerase was then mixed into 
the solution, which was then incubated at 94 °C for 
5 2 minutes and subjected to 25 cycles of 98 °C for 

15 seconds, 65°C for 2 seconds, and 74 °C for 30 seconds. 
Using the QIAquick PCR purification kit, the 7 98 bp PCR 
product was purified according to the manual. The purified 
PCR fragment (2 was mixed with 7 \il of the Boehringer's 

10 5 x T4 DNA polymerase buffer (330 mM Tris-acetate, pH 8.0; 
660 mM potassium acetate; 100 mM magnesium acetate; and 
5 mM DTT), 1.5 pi of 2 mM dCTP, and 21.5 \il of sterilized 
water. Three units of T4 DNA polymerase were then added, 
and the solution was reacted at 12 °C for 30 minutes. The 

15 present inventors inactivated the enzyme by incubation at 
80 °C for 15 minutes and purified the mixture with the 
QIAquick PCR purification kit according to the manual. The 
pGEX-2TK plasmid, which had been digested with restriction 
enzymes EcoRI and BamHI and modified with Klenow, and the 

20 T4 polymerase-treated PCR product were ligated by one unit 
of T4 DNA ligase (Promega) using the attached buffer (30 mM 
Tris-HCl (pH 7.8), 10 mM MgCl 2 , 10 mM DTT, and 1 mM ATP) at 
15 °C overnight. The reaction mixture was then used to 
transform E. coli DH5 alpha. The recombinant protein 

25 expressed by the transformant was designated GST-PDZ56. 

Similarly, a PCR product encoding the amino acids from 
position 611 to position 1142 of SEQ ID NO: 1 was prepared 
by the above described method with primer 1-7 
( 5 ' -ATCGATGGGTAGTAATCACACACAG-3 7 (SEQ ID NO: 70)) and 

30 primer 527-532E ( 5 9 -AATTGCTATACTGGATCCAGAGAGTGG-3 ' (SEQ ID 
NO: 71)), using clone 32-8-1/5R3 as the template. The 
preparation was treated with T4 polymerase under the same 
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conditions as before, purified with the EcoRI and BamHI- 
digested and Klenow-treated pGEX-2TK, and ligated. The 
reaction mixture was used to transform E. coli DH5 alpha. 
The recombinant protein expressed by the transformant was 
5 designated GST-PDZ14 . 

The E. coli transf ormants that express GST-PDZ56 were 
selected by the following method. Four colonies of the E. 
coli transf ormants obtained from the above were shake- 
cultured at 37 °C overnight in 2 ml of the LB medium (5 g of 

10 Bacto-yeast extract (DIFCO) , 10 g of Bacto-trypton (DIFCO) f 
and 10 g of NaCl made to 1 L by dissolving them in 
distilled water) containing 100 pg/ml ampicillin. The 
solution was then diluted 100 fold in the medium of the 
same composition, and isopropyl thiogalactoside (IPTG) was 

15 added to a final concentration of 0.1 mM. The solution was 
then shake-cultured at 37 °C for 3 hours. A 100 jil sample 
was precipitated by centrifugation at 15,000 rpm for 
10 seconds and analyzed on a 10% to 20% SDS-polyacrylamide 
gel. The subsequent Coomassie staining easily detected the 

20 IPTG-induced expression of the approximately 55 kDa GST 
fusion protein from every transformant (Figure 9) . 
Furthermore, western blotting with an anti-GST antibody 
also confirmed the induced expression of the 55 kDa protein 
band (Figure 10) . For detection, the proteins in the 

25 samples separated on the 10% to 20% SDS-polyacrylamide gel 
were transferred onto Immobilon-P (Millipore) using a 
Semidry blotter (Bio-Rad) according to the methods 
described in the manual. The filter was then blocked at 
4°C overnight with 5% skim milk (DIFCO), 2.5% bovine serum 

30 albumin (Sigma, A5940) , and T-TBS (20 mM Tris-HCl (pH 7.5), 
150 mM NaCl, and 0.05% Tween 20). It was next reacted at 
room temperature for 1 hour with the anti-sheep GST 
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antibody (Pharmacia) diluted 1,000 fold in the antibody 
dilution buffer (1% skim milk, 0.5% bovine serum albumin, 
T-TBS), then reacted at room temperature for 1 hour with 
the alkaline phosphatase-labeled anti-sheep IgG antibody 
diluted 1,000 fold in the antibody dilution buffer. 
Finally, the protein was detected by a GST Detection Module 
(Pharmacia) . 

The GST-PDZ14 was similarly expressed. However, 
E. coli HB101 and JM109 were used as hosts because E.coli 
DH5 did not produce an efficient IPTG-induced expression. 
The results shown in Figure 11 indicate that E. coli HB101 
did not produce very large amounts of expression products, 
but that the GST-PDZl4-derived band near 90 kDa was highly 
induced in E.coli JM109. E. coli JM109 was subsequently 
used to express and purify the fusion protein. 

(2) Expression and purification of the GST 32-8-1 fusion 
protein 

The present inventors followed the method for 
preparing fusion proteins described on page 217 of the 
Supplement to Jikken Igaku (Experimental Medicine) "Shin 
Idenshi Kogaku Handbook (New Genetic Engineering Handbook) 
edited by Masami Muramatsu, et. al." published by Yohdosha 
to express and purify the GST fusion proteins. GST-PDZ14 
and GST-PDZ56 were each cultured at 37 °C for 1 hour in 2 L 
of LB medium to which IPTG was added to achieve a final 
concentration of 0.1 mM, then shake-cultured at 25°C for 
5 hours. The cells were collected at 7,000 rpm for 
10 minutes, resuspended in a sonication buffer consisting 
of PBS and 1% Triton X-100, and sonicated 1 minute for five 
times while chilling. The supernatant was obtained by 
centrifugation at 10,000 rpm for 15 minutes. It was then 
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applied onto a glutathione-Sepharose column, washed well 
with PBS, and purified with the GST Purification Module 
elution buffer (Pharmacia Biotech) . 

The 32-8-1 gene was inserted into the multi-cloning 
5 site of the pGEX-2TK expression vector (Pharmacia) . This 
vector has a region encording the string of amino acids 
"Leu-Val-Pro-Arg-Gly-Ser" recognized by thrombin protease 
in frame after GST protein gene, so the GST protein portion 
can be separated from GST-32-8-1 fusion protein by applying 

10 the thrombin protease that recognizes this sequence and 
digesting the protein. This is useful for preparing 
antibodies against the protein encoded by the 32-8-1 gene 
(the 32-8-1 protein) . Because the glutathione-Sepharose 
column binds to the GST protein, it was possible to purify 

15 the PDZ14 and PDZ56 portions only as the fractions not 

binding to the glutathione-Sepharose (Figures 12, 13, and 
14) by applying the protein solution that had been digested 
with the thrombin protease onto the glutathione-Sepharose 
column. The 55 kDa GST-PDZ56 protein bands seen in 

20 Figure 12 (lanes 11 and 12) were digested by thrombin into 
the 25 kDa GST protein and the 30 kDa PDZ56 protein 
(lane 10) . Furthermore, the results of western blotting 
using the anti-GST antibody indicated that the anti-GST 
antibody reacted only with the 55 kDa and 25 kDa proteins, 

25 both of which contained the GST protein (Figure 13) . 

Together, these confirmed that the PDZ56 protein portion 
was cut off as the 30 kDa band (lanes 8 and 9) . Similarly 
to GST-PDZ14, the 90 kDa GST-PDZ14 can be separated by 
thrombin digestion into the 25 kDa GST protein and the 65 

30 kDa PDZ14 protein portion, as shown in Figure 14. 

Therefore, the protein was purified according to the 
following procedure. The present inventors used the method 
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described in the Supplement to Jikken Igaku (Experimental 
Medicine) "Shin Idenshi Kogaku Handbook (New Genetic 
Engineering Handbook) edited by Masami Muramatsu, et. al" 
published by Yohdosha to culture E. coli cells. They then 
5 used the supernatant of the sonicated cells to digest the 
proteins by thrombin. The detailed method followed the 
procedure described under Thrombin Cleavage on page 16 of 
the GST Gene Fusion System (Pharmacia) . They next added 
10 ]il (10 cleavage units) of thrombin per 1 mg of the 

10 fusion protein and incubated the mixture at room 

temperature for 16 hours to separate the PDZ14 protein or 
the PDZ56 protein portion from the GST portion. By letting 
the cleaved GST protein portion bind to the glutathione- 
Sepharose column (Pharmacia), they recovered 0.56 mg of the 

15 PDZ14 or 3.5 mg of the PDZ56 protein portion as the flow- 
through from the column. 

(3) Preparation of polyclonal antibodies using the antigens 
expressed in E. coli 

20 The present inventors obtained polyclonal antibodies 

by immunizing two rabbits each with the purified PDZ14 or 
PDZ56 antigen. The initial immunization was done by 
subcutaneously injecting 0.5 mg of PDZ56 or 0.22 mg of 
PDZ14 bound with the carrier protein per animal as an 

25 antigen emulsion mixed with an equal amount of Freund' s 
complete adjuvant (FCA) by the standard method. Booster 
injections of 0.25 mg of PDZ56 or PDZ14 were given 
subcutaneously as an antigen emulsion mixed with an equal 
amount of Freund' s incomplete adjuvant (FICA) three times 

30 at two-week intervals. The proteins used as antigens were 
separated by SDS-PAGE and transferred onto a PVDF membrane 
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(Immobilon-P, Millipore) ; the reactivity was confirmed by 
western blotting. 

(4) Preparation of polyclonal antibodies using a peptide 
5 Iwaki Glass synthesized Peptide 32-8-1-17 (SEQ ID 

NO: 72) consisting of 21 amino acids under contract. The 
Keyhole limpet hemocyanin (KLH) protein was coupled to the 
peptide as a carrier protein by the Sulfo-MBS method, and 
two rabbits were immunized with the product. The initial 

10 immunization was done by subcutaneously injecting 0.4 mg of 
the 32-8-1-17 peptide bound with the carrier protein per 
animal as an antigen emulsion mixed with an equal amount of 
Freund's complete adjuvant (FCA) by the standard method. 
The second through the fifth immunizations were given at 

15 two-week intervals by subcutaneously injecting 0.2 mg of 

the 32-8-1-17 peptide bound with the carrier protein as an 
antigen emulsion mixed with an equal amount of Freund' s 
incomplete adjuvant (FICA) . The antibody titers were 
measured using an ELISA plate coated with the 32-8-1-17 

20 peptide; the antisera were obtained when the titer had 
risen sufficiently. 

(5) Reactivity of the polyclonal antibodies 

Antisera were obtained by immunizing the rabbits with 
25 peptide 32-8-1-17 or with PDZ14 or PDZ56 protein expressed 
as a GST-fusion protein and then digested with thrombin to 
retain only the 32-8-1 gene product. The reactivity of the 
antisera was detected by western blotting using Protein 
Medley manufactured by Clontech. More specifically, 100 jig 
30 each of the cell lysates of the tissues from the human 

Testis (T) , Skeletal Muscle (Sk) , Liver (Lv) , Heart (H) , 
and Brain (B) of the Protein Medley manufactured by 
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Clontech was separated on a 10% to 20% SDS-polyacrylamide 
gel. The cell lysates were then transferred onto 
Immobilon-P (Millipore) using the Semidry blotter (Bio-Rad) 
according to the methods described in the manual. The 
5 filter was then blocked at 4°C overnight with 5% skim milk 
(DIFCO), 2.5% bovine serum albumin (Sigma, A5940) , and 
T-TBS (20 mM Tris-HCl (pH 7.5), 150 mM NaCl, and 0.05% 
Tween 20) . The filter was next reacted at room temperature 
for 1 hour with the individual rabbit antisera diluted 

10 5,000 fold in the antibody dilution buffer (1% skim milk, 

0.5% bovine serum albumin, and T-TBS). It was then reacted 
at room temperature for 1 hour with the biotin-labeled 
anti-rabbit Ig diluted 1,000 fold in the antibody dilution 
buffer. Finally, the filter was further reacted at room 

15 temperature for 15 minutes with the horseradish peroxidase 
(HRP) -labeled streptavidin-biotin complex (Amersham) 
diluted 2,500 fold in the antibody dilution buffer and 
washed well with T-TBS. The reacting bands were detected 
by chemiluminescence using an ECL detection kit (Amersham) 

20 according to the manual. Consequently, as shown in 

Figure 15, a band that reacted with every antibody and is 
presumed to be derived from the 32-8-1 protein was detected 
in the liver tissue sample near the 130 kDa. 

25 Example 7 Cloning of an upstream cDNA of 686-1-4 by RACE 

The present inventors attempted to obtain a cDNA clone 
5' upstream of 686-1-4, which was cloned from the human 
heart, by the 5' Rapid Amplification of cDNA Ends (RACE) 
method. The details follow. 

30 
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(1) Cloning of cDNA clone D-2 by RACE 

The present inventors performed 5' Rapid Amplification 
of cDNA Ends (RACE) by using the Marathon Ready human brain 
cDNA (Clontech, #7400-1) as the adult human brain cDNA 
5 library. The 5 ' RACE mixture consisted of 5 pi of the 

Marathon Ready adult human brain cDNA, 1 jil each of 10 yiM 
primer #878 ( 5 9 -TTTGTGCCCACCAGAGCCAAGTCAG-3 ' (SEQ ID 
NO: 73)) and 10 pM API primer (which came with the kit: 
5' -CCATCCTAATACGACTCACTATAGGGC-3' (SEQ ID NO: 42)), 1 ]il of 

10 Advantage™ KlenTaq polymerase mix (Toyobo, CLK8417-1) , and 
33 ]il of deionized water to make the total 50 pi. A PCR 
reaction using the Thermal Cycler (95°C for 1 minute, five 
cycles of 95°C for 5 seconds and 72°C for 4 minutes, five 
cycles of 95°C for 5 seconds and 70°C for 4 minutes, and 

15 25 cycles of 95°C for 5 seconds and 68°C for 4 minutes) did 
not produce clearly detectable bands. Therefore, the 
reaction mixture was diluted 50 fold, and 5 \il of this was 
mixed with 5 ]il of 10 x Advantage™ KlenTaq buffer (which 
came with the kit), 4 \il of 2,5 mM dNTP, 1 ]il of 10 ]iM AP2 

20 primer (which came with the kit; 

5' ACTCACTATAGGGCTCGAGCGGC-3 (SEQ ID NO: 44)), 1 ]il of 
10 iaM 32-8-1 5' RACE nested primer #757 

(5' -GTGAAAGGGGTAAAGGCTTAGCAAC-3' (SEQ ID NO: 74)), 1 \xl of 
Advantage™ KlenTaq polymerase mix (Toyobo, CLK8 417-1) , and 

25 33 ]il of deionized water to make the total 50 ial. Nested 

PCR was performed at 95°C for 1 minute, five cycles of 95°C 
for 5 seconds and 72 °C for four minutes, and five cycles of 
95 °C for 5 seconds and 70 °C for 4 minutes. Subsequent 
treatment of 15 cycles at 95°C for 5 seconds and 68 °C for 

30 4 minutes produced a band of 1.8 kb. The products were 
separated on a 0.8% agarose gel. The corresponding band 
was excised and purified with the QIAquick gel extraction 
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kit (QIAGEN, 28706) and subjected to TA cloning according 
to the manual for the pGEM-T Vector System I (Promega, 
A3600) • The resulting clone was designated D-2. 

The nucleotide sequence determination was done as 
5 previously described, and the nucleotide sequence of 1,77 6 
base pairs was determined by means of the cycle sequence 
method using the Dye Terminator Cycle Sequencing FS Ready 
Reaction kit (Perkin Elmer, Catalog #402122) . The 
nucleotide sequence thus determined was found to encode 590 
10 amino acids (SEQ ID NO: 75) . 

(2) Cloning of cDNA clone 1.2 kb#33 by RACE 

The open reading frame that exists within the sequence 
of Clone D-2 is a sequence upstream of the 781 st nucleotide 

15 of SEQ ID NO: 3. This open reading frame is not closed, 
that is, it does not have a stop codon, so it was assumed 
that the open reading frame continues further upstream. 
Thus the present inventors prepared a new primer and 
performed 5' RACE. By using the Marathon Ready human brain 

20 cDNA (Clontech, #7400-1) as the template, the present 

inventors performed 5' Rapid Amplification of cDNA Ends 
(RACE). The 5' RACE mixture consisted of 5 ]il of the 
Marathon Ready adult human brain cDNA, 1 \il each of 10 yM 
primer B5R-1 ( 5 ' -GCAGATGGAGAACGGGAAACTATGG-3 ' (SEQ ID 

25 NO: 7 6) ) and 10 mM API primer (which came with the kit; 

5 ' -CCATCCTAATACGACTCACTATAGGGC-3 ' (SEQ ID NO: 42)), 5 ]il of 
10 x Advantage™ KlenTaq buffer (which came with the kit) , 

4 )il of 2.5 mM dNTP, 1 pi of Advantage™ KlenTaq polymerase 
mix (Toyobo, CLK8417-1), and 33 ]il of deionized water to 

30 make a total of 50 pi. A PCR reaction using the Thermal 
Cycler (95°C for 1 minute, five cycles of 95°C for 

5 seconds and 72 °C for 3 minutes, and five cycles of 95°C 
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for 5 seconds and 70 °C for 3 minutes, followed by 25 cycles 
of 95 °C for 5 seconds and 68 °C for 3 minutes) did not 
produce detectable bands. Therefore, the reaction mixture 
was diluted 50 fold, and 5 ]al of this was used as the 
5 template to perform nested PCR. The template was mixed 
with 5 pi of 10 x Advantage™ KlenTaq buffer (which came 
with the kit), 4 pi of 2,5 mM dNTP, 1 ]il of 10 ]M AP2 
primer (which came with the kit; 

5' -ACTCACTATAGGGCTCGAGCGGC-3 (SEQ ID NO: 44)), 1 ]il of 

10 10 \iM primer B5R-2 (5' -GAACGGGAAACTATGGGGCTGACAA-3' (SEQ ID 
NO: 77)) , 1 ]il of Advantage™ KlenTaq polymerase mix 
(Toyobo, CLK8417-1), and 33 \il of deionized water to make 
the total 50 A reaction consisting of 95°C for 

1 minute, five cycles of 95°C for 5 seconds and 72°C for 

15 3 minutes, five cycles of 95°C for 5 seconds and 70°C for 
3 minutes, followed by 15 cycles of 95°C for 5 seconds and 
68 °C for 3 minutes produced a band of 0.8 kb. The products 
were separated on a 0.8% agarose gel, the corresponding 
band was excised and purified with the QIAquick gel 

20 extraction kit (QIAGEN, 28706) . It was then subjected to 
TA cloning according to the manual for the pGEM-T Vector 
System I (Promega, A3600) . The resulting clone was 
designated 1.2 kb#33. The results of the nucleotide 
sequence determination (conducted as described before) 

25 revealed that the ATG codon starting with the 71 st 

nucleotide corresponded to the first methionine, and that 
the clone encoded 235 amino acids. The last amino acid, 
arginine, corresponded to the arginine encoded by the 
nucleotides from position 108 to position 110 of clone D-2, 

30 and the nucleotides from position 1 to position 110 of the 
clone D-2 sequence overlapped with clone 1.2 kb#33. 
Therefore, the present inventors concluded that all the 
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upstream sequences of the deduced open reading frame were 
cloned. 

(3) Analysis of the difference in amounts of expression in 
5 tissues 

Amounts of mRNA expression in 24 types of 
tissues, including those that were examined by the northern 
blot above, were compared by RT-PCR. Human MTC panel I 
(K1402-1), human MTC panel II (K1421-1), and human fetal 

10 MTC panel I (K1425-1) , which are commercially available 

from Clontech, were used as the cDNA. Results of the PCR 
reaction with the following reaction conditions are shown 
in Figure 16-A. PreMixTaq (10 |il; ExTaq TM Version) 
(Takara, PR003A) , 2 pi of 2 \iM 686D primer (SEQ ID NO: 66, 

15 corresponding to positions 2970 to 2989 of SEQ ID NO: 83, 
and to positions 1 to 20 of Figure 17), 2 ]jl of 2 pM 686E 
primer (SEQ ID NO: 67, corresponding to positions 3635 to 
3654 of SEQ ID NO: 83, and to positions 666 to 685 of 
Figure 17), 1 pi of the first strand cDNA, and 5 \il of 

20 deionized water were mixed to make a total of 20 \il and 

reacted. Reaction of 94 °C for 5 minutes, and 30 cycles of 
a three-step PCR reaction consisting of 94 °C for 
15 seconds, 50°C for 30 seconds, and 72°C for 30 seconds, 
followed by an extension reaction of 72 °C for 7 minutes 

25 were performed. 

As a result, high degrees of expression were seen in 
the brain (1), placenta (7), skeletal muscle (8), 
ovary (10), spleen (14), testis (15), fetal heart (18), 
fetal kidney (19), and fetal skeletal muscle (22). In 

30 addition, 15 \il of PreMixTaq (ExTaq TM Version) (Takara, 

PR003A), 3 |il of 2 ]M 686D primer (SEQ ID NO: 66), 3 \il of 
2 |iM XE primer (SEQ ID NO: 15, which corresponds to 
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positions 3915 to 3934 of SEQ ID NO: 83, and to positions 
946 to 965 of Figure 17), 1 \xl of the first strand cDNA, 
and 8 pi of deionized water were mixed to make a total of 
30 ]il and reacted. Reaction of 94 °C for 5 minutes, and 
5 30 cycles of a three-step PCR reaction consisting of 94 °C 
for 15 seconds, 50°C for 30 seconds, and 72°C for 1 minute, 
followed by an extension reaction of 72 °C for 7 minutes 
were performed. As shown in Figure 16-B, bands of 750 bp, 
850 bp, and 950 bp, which are presumed to be derived from 

10 three types of transcription products, were detected. 

Since the experiments in Figure 16-A and 16-B both used the 
same 686D primer, three types of splicing should take place 
within the sequences that exist in between the 686E primer 
and the XE primer, producing the transcription products of 

15 different lengths. The three types of PCR products were 
cloned from those of the fetal heart that showed the 
highest expression. Cloning was done by excising the 
corresponding bands and purifying them using the QIAquick 
gel extraction kit (QIAGEN, 28706), and by following the 

20 manual for the pGEM-T Vector System I (Promega, A3600) . 

(4) Analysis of the gene sequences of clones FH750, FH850, 
and FH950 

The nucleotide sequences for the cloned PCR products 
25 were determined according to the method described above. 
The determined nucleotide sequences of FH750, FH850, and 
FH950 are shown in SEQ ID NO: 79, 80, and 81, respectively. 
The sequences of the three kinds of DNA are aligned and 
shown in Figures 17 and 18. Although the sequences of the 
30 three kinds of DNA are identical up to sequence position 
731, FH850 diverges from FH950 beginning with the 819 th 
nucleotide, which suggests that splicing takes place at 
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some sequence immediately preceding this position. As a 
result of this splicing, FH850 generates a stop codon with 
the sequence from position 819 to position 821, and the 
translation is expected to terminate at this position. 
5 In FH750, positions 732 to 941 of the FH950 sequence 

are spliced out, creating a 210 bp deletion of the gene. 
However, the protein encoded by the transcription product 
of the FH750 type splicing is predicted to lack 70 amino 
acids compared with FH950 type splicing since the sequences 

10 before and after the deleted region are expected to be 
translated in the same frame as in those of FH950. 

The sequence obtained by combining 1.2 kb#33 (SEQ ID 
NO: 78), D-2 (SEQ ID NO: 75), and SEQ ID NO: 3 (Figure 26) 
was bordered by primer 68 6D and primer XE derived from the 

15 686-1-4 sequence and was identical to FH750. The clone 

corresponding to the transcription product expected to be 
generated by the FH750 type splicing was designated 32-8- 
la. (The amino acid sequence of the protein is shown in 
SEQ ID NO: 82, and the nucleotide sequence of the cDNA is 

20 shown in SEQ ID NO: 85.) Clone 32-8-la can code for 2,000 
amino acids. The clone corresponding to the transcription 
product expected to be generated by the FH950 type splicing 
was designated 32-8-lb. (The amino acid sequence of the 
protein is shown in SEQ ID NO: 83, and the nucleotide 

25 sequence of the cDNA is shown in SEQ ID NO: 86.) Code 

32-8-lb can code for 2,070 amino acids. These two genes 
possess 13 PDZ domains. Furthermore, the transcription 
product generated by the FH850 type splicing will contain a 
stop codon in this region, and it can only code for 1,239 

30 amino acids. This means that it possesses only seven PDZ 
domains. The clone corresponding to this transcription 
product was designated 32-8-lc. (The amino acid sequence 
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of the protein is shown in SEQ ID NO: 84, and the 
nucleotide sequence of the cDNA is shown in SEQ ID NO: 87.) 

Sequence comparisons between "32-8-lb" and "AF00168" 
(Mus musculus 90RF binding protein 1 (9BP-1) mRNA, partial 
5 cds.) are shown in Figure 20; between "32-8-lb" and 
"AJ001319" (Homo sapiens mRNA for multi PDZ domain 
protein) , in Figure 21; and between "32-8-lb" and 
"AJ001320" (Rattus norvegius mRNA for multi PDZ domain 
protein), in Figures 22 through 24, The PDZ domain 
10 sequences of the protein encoded by the 32-8-lb gene (SEQ 
ID NO: 83) are also shown in Figure 25. 

(6) Identification of the 32-8-lb high molecular weight 
protein by western blotting 

15 Human neuroblastoma SH-SY5Y cells and human 

teratocarcinoma NT-2 cells stimulated by retinoic acid to 
differentiate into neurons were directly dissolved into 
SDS-PAGE sample buffer and separated on a 7.5% SDS- 
polyacrylamide gel by electrophoresis. The proteins were 

20 transferred onto Immobilon-P (Millipore) using the Semidry 
blotter (Bio-Rad) according to the methods described in the 
manual. The filter was then blocked at 4°C overnight with 
5% skim milk (DIFCO) , 2.5% bovine serum albumin (Sigma, 
A5940), and T-TBS (20 mM Tris-HCl (pH 7.5), 150 mM NaCl, 

25 and 0.05% Tween 20). It was next reacted at room 

temperature for 1 hour with the individual rabbit antisera 
diluted 5,000 fold in the antibody dilution buffer (1% skim 
milk, 0.5% bovine serum albumin, and T-TBS). It was then 
reacted at room temperature for 1 hour with the biotin- 

30 labeled anti-rabbit Ig diluted 1,000 fold in the antibody 
dilution buffer. Finally, it was reacted at room 
temperature for 15 minutes with the horseradish peroxidase 
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(HRP) -labeled streptavidin-biotin complex (Amersham) 
diluted 2,500 fold in the antibody dilution buffer and 
washed well with T-TBS. The reacting bands were then 
detected by chemiluminescence using an ECL detection kit 
5 (Amersham) according to the manual. As shown in Figure 19, 
a protein whose molecular weight exceeds 250 kDa was 
detected in both SH-SY5Y and NT-N with either rabbit 
antisera #1 raised against peptide 32-8-1-17 or rabbit 
antisera #3D raised against PDZ56 that had been expressed 
10 as a GST fusion protein and digested with thrombin to 

retain only the 32-8-1 gene product. The assumption that 
the full-length 32-8-lb protein consists of 2,070 amino 
acids agrees with the molecular weight observed. 

15 Industrial Applicability 

By utilizing the proteins and the gene of the present 
invention, it has become possible to isolate the proteins 
and their genes that bind to the PDZ domains of the 
proteins of the present invention. It has been reported 

20 that proteins having the PDZ domain interact with the 
proteins that bind to them and function in the signal 
transduction related to cell proliferation, cell cycle, 
malignant conversion, apoptosis, cell adhesion, etc. 
Therefore, if the relationships between the proteins of the 

25 present invention and the proteins that interact with them, 
as well as the related signal transduction pathways, can be 
clarified, it should be possible to treat and diagnose 
disorders related to cell proliferation and others 
described above by targeting these proteins or their genes. 

30 These proteins and their genes are therefore useful for 

developing therapeutic medicines and diagnostic medicines. 
What is claimed is: 
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1 1 . A substantially pure polypeptide comprising an 

2 amino acid sequence at least 85% identical to any one of 

3 SEQ ID NOs:l, 2, 82, 83, or 84, wherein the polypeptide 

4 contains a PDZ domain sequence. 

1 2. The polypeptide of claim 1, wherein the amino acid 

2 sequence is at least 90% identical to any one of SEQ ID 

3 NOs: 1, 2, 82, 83, or 84. 

1 3. A substantially pure polypeptide comprising the 

2 sequence of any one of SEQ ID NOs: 1, 2, 82, 83, or 84. 

1 4. A substantially pure polypeptide comprising the 

2 amino acid sequence of any one of SEQ ID NOs: 1, 2, 82, 83, 

3 or 84, with up to 50 conservative amino acid substitutions, 

4 wherein the polypeptide contains a PDZ domain sequence. 

1 5. A substantially pure polypeptide encoded by a 



2 nucleic acid that hybridizes under high stringency 

3 conditions to a probe the sequence of which consists of any 

4 one of SEQ ID NOs:3, 59, 75, 78, 79, 80, 81, 85, 86, or 87, 

5 wherein the polypeptide contains a PDZ domain sequence. 



1 6. An isolated nucleic acid encoding the polypeptide 

2 of claim 1. 

1 7 . An isolated nucleic acid encoding the polypeptide 

2 of claim 3. 

1 8 . An isolated nucleic acid encoding the polypeptide 

2 of claim 4. 
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1 9 . An isolated nucleic acid comprising a strand that 

2 hybridizes under stringent conditions to a single stranded 

3 probe, the sequence of which consists of any one of SEQ ID 

4 NOs: 3, 59, 75, 78, 79, 80, 81, 85, 86, or 87, or the 

5 complement of any one of SEQ ID NOs: 3, 59, 75, 78, 79, 80, 

6 81, 85, 86, or 87. 



1 10. The isolated nucleic acid of claim 9, wherein the 

2 nucleic acid encodes a polypeptide that contains a PDZ 

3 domain. 

1 11. The nucleic acid of claim 10, wherein the amino 

2 acid sequence of the polypeptide comprises any one of SEQ 

3 ID NOs:l, 2, 82, 83, or 84. 

1 12. The nucleic acid of claim 9, wherein the strand 

2 is at least 15 nucleotides in length. 

1 13. The nucleic acid of claim 12, wherein the nucleic 

2 acid is an antisense nucleic acid that inhibits expression 

3 of a polypeptide comprising any one of SEQ ID NOs:l, 2, 82, 

4 83, or 84. 

1 14. A vector comprising the nucleic acid of claim 6. 

1 15. A vector comprising the nucleic acid of claim 7. 

1 16. A vector comprising the nucleic acid of claim 8. 

1 17. A vector comprising the nucleic acid of claim 9. 

1 18. A vector comprising the nucleic acid of claim 10. 
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1 19. A cultured host cell comprising the nucleic acid 

2 of claim 6. 

1 20. A cultured host cell comprising the nucleic acid 

2 of claim 7. 

1 21. A cultured host cell comprising the nucleic acid 

2 of claim 8. 

1 22. A cultured host cell comprising the nucleic acid 

2 of claim 9. 

1 23. A cultured host cell comprising the nucleic acid 

2 of claim 10. 

1 24. An antibody that specifically binds to the 

2 polypeptide of claim 1. 

1 25. A method of producing a polypeptide, the method 

2 comprising isolating the polypeptide from the cultured host 

3 cell of claim 19. 

1 26. A method of screening for a compound that 

2 specifically binds to a polypeptide, the method comprising 

3 contacting a test compound with the polypeptide of claim 1, 

4 and comparing the extent to which the test compound binds 

5 to the polypeptide with the extent to which a reference 

6 compound binds to the polypeptide, wherein a test compound 

7 binding to the polypeptide to a greater extent than the 

8 reference compound indicates that the test compound 

9 specifically binds to the polypeptide. 
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1 27. The method of claim 26, wherein the test compound 

2 is a test polypeptide, 

1 28. The method of claim 27, further comprising 

2 identifying the gene that encodes the test polypeptide. 

1 29. A compound that binds to the polypeptide of 

2 claim 1. 

1 30. The compound of claim 29, wherein the compound is 

2 a polypeptide. 

1 31. A gene encoding the compound of claim 30. 

1 32. The nucleic acid of claim 12, wherein the nucleic 

2 acid is an antisense nucleic acid that inhibits expression 

3 of a polypeptide comprising any one of SEQ ID NOsrl, 2, 82, 

4 83, or 84. 

1 33. A fusion protein comprising any one of SEQ ID 

2 N0s:l, 2, 82, 83, or 84 and another amino acid sequence. 

1 34. The fusion protein of claim 33, wherein the other 

2 amino acid sequence is specifically bound by an antibody. 
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PROTEIN HAVING PDZ DOMAIN SEQUENCE 
Abstract of the Disclosure 

While analyzing changes in gene expression by 
TNFa in human umbilical vascular endothelial cells, a gene 
showing enhanced expression due to stimulation with TNFa 
was isolated. After screening with this gene as a probe, a 
gene encoding a protein was isolated. Analysis of the 
protein encoded by this isolated gene revealed that this 
novel protein has never been reported and has a PDZ domain 
in its molecule that plays an important role in protein- 
protein interactions . 

20025339.doc 
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Figure 1 
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921 ATCPI I PGCETTI EI SKGRTGLGLS I VGGSDTLLGAFI IHEVYEEGAACK 970 
1 ATCPIIPGCETTIEISKGRTGLGLSIVGGSDTLLGAII IHEVYEEGAACK 50 

971 DGRLWAGOQILEVNGIOLRKATHDEAINVLRQTPQRVRLTLYROEAPYKE 1020 

51 DGRLWAGIX3ILEVNGIDLRKATHDEAINVLRQTPQRVRLTLYRDEAPYKE 100 
1021 EEVCDTLTIELQKKPGKGLGLSIVGKRNDTGVFVSDIVKGGIADPDGRLI 1070 

101 EEVCDTLTIELQKKPGKGLGLSIVGKRNDTGVFVSDIVKGGIAOADGRLM 150 
1071 QGDQILLVNGEDVRNASQEAVAALLKCSLGTVTLEVGRIKAGPFH8ERRP 1120 

151 QGDQILHVNGEDVRNATQEAVAALLKCSLGTVTLEVGRIKAGPFHSERRP 200 
1121 SQTSQVSEGSLSSFTFPLSGSSTSESLESSSKKNALASEIQGLRTVEMKK 1 1 70 

201 SQSSQVSEGSLSSFTFPLSGSSTSESLESSSKKNALASEIQGLRTVEMKK 250 
1171 GPTDSLGISIAGGVGSPLGDVPIFIAMMHPTGVAAQTQKLRVGDRIVTIC 1220 

251 GPTDSLGISIAGGVGSPLGDVPIFIAMMHPTGVAAQTQKLRVGDRIVTIC 300 
1221 GTSTEGMTHTQAVNLLKNASGSIEMQWAGGDVSVVTGHHQEPASSSLSF 1270 

301 GTSTEGMTHTQAVNLLKNASGSIEMQWAGGDVSWTGHQQEPASSSLSF 350 
1271 TGLTSTSI FQODLGPPQCKSI TLERGPOGLGFSI VGGYGSP HGOLP I YVK 1320 

351 TGLTSSS I FQODLGPPQCKSI TLER6PDGLGFS I VGGYGSP HGOLP I YVK 400 
1321 TVFAKGAASEDGRLKRGDQIIAVNGQSLEGVTHEEAVAILKRTKGTVTLM 1370 

401 TVFAKGAASEDGRLKRGDQIIAVNGQSLEGVTHEEAVAILKRTKGTVTLM 450 

1371 VLS 1373 
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451 VLS 453 
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Figure 17 
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SEQUENCE LISTING 

<110> CHUGAI RESEARCH INSTITUTE FOR MOLECULAR MED I C INE , INC . 

<120> PROTEIN HAVING PDZ DOMAIN SEQUENCE 

<130> C2-905DP1PCT 

<140> 
< 141 > 

<150> JP 09-230356 

JP 10-189944 
< 15 1 > 1997-8-12 

1998-6-19 
<160> 87 

< 1 70 > Patentln version 2.0 

<210> 1 
<211> 1373 
<212> PRT 

< 2 1 3 > Homo sapience 



<400> 1 
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Cys Cys Arg Arg Thr Val Pro Pro Thr Thr Gin Ser Glu Leu Asp Ser 

5 10 15 

Leu Asp Leu Cys Asp He Glu Leu Thr Glu Lys Pro His Val Asp Leu 

20 25 30 

Gly Glu Phe He Gly Ser Ser Glu Thr Glu Asp Pro Val Leu Ala Met 
35 40 45 50 

Thr Asp Ala Gly Gin Ser Thr Glu Glu Val Gin Ala Pro Leu Ala Met 

55 60 65 

Trp Glu Ala Gly He Gin His He Glu Leu Glu Lys Gly Ser Lys Gly 

70 75 80 

Leu Gly Phe Ser He Leu Asp Tyr Gin Asp Pro He Asp Pro Ala Ser 

85 90 95 

Thr Val He He He Arg Ser Leu Val Pro Gly Gly He Ala Glu Lys 

100 105 110 

Asp Gly Arg Leu Leu Pro Gly Asp Arg Leu Met Phe Val Asn Asp Val 
115 120 125 130 

Asn Leu Glu Asn Ser Ser Leu Glu Glu Ala Val Glu Ala Leu Lys Gly 

135 140 145 

Ala Pro Ser Gly Thr Val Arg He Gly Val Ala Lys Pro Leu Pro Leu 

150 155 160 

Ser Pro Glu Glu Gly Tyr Val Ser Ala Lys Glu Asp Ser Phe Leu Tyr 

165 170 175 

Pro Pro His Ser Cys Glu Glu Ala Gly Leu Ala Asp Lys Pro Leu Phe 

180 185 190 

Arg Ala Asp Leu Ala Leu Val Gly Thr Asn Asp Ala Asp Leu Val Asp 
195 200 205 210 

Glu Ser Thr Phe Glu Ser Pro Tyr Ser Pro Glu Asn Asp Ser lie Tyr 
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215 220 225 

Ser Thr Gin Ala Ser He Leu Ser Leu His Gly Ser Ser Cys Gly Asp 

230 235 240 

Gly Leu Asn Tyr Gly Ser Ser Leu Pro Ser Ser Pro Pro Lys Asp Val 

245 250 255 

He Glu Asn Ser Cys Asp Pro Val Leu Asp Leu His Met Ser Leu Glu 

260 265 270 

Glu Leu Tyr Thr Gin Asn Leu Leu Glu Arg Gin Asp Glu Asn Thr Pro 
275 280 285 290 

Ser Val Asp He Ser Met Gly Pro Ala Ser Gly Phe Thr He Asn Asp 

295 300 305 

Tyr Thr Pro Ala Asn Ala He Glu Gin Gin Tyr Glu Cys Glu Asn Thr 

310 315 320 

He Val Trp Thr Glu Ser His Leu Pro Ser Glu Val He Ser Ser Ala 

325 330 335 

Glu Leu Pro Ser Val Leu Pro Asp Ser Ala Gly Lys Gly Ser Glu His 

340 345 350 

Leu Leu Glu Gin Ser Ser Leu Ala Cys Asn Ala Glu Cys Val Met Leu 
355 360 365 , 370 

Gin Asn Val Ser Lys Glu Ser Phe Glu Arg Thr He Asn He Ala Lys 

375 380 385 

Gly Asn Ser Ser Leu Gly Met Thr Val Ser Ala Asn Lys Asp Gly Leu 

390 395 400 

Gly Met lie Val Arg Ser He lie His Gly Gly Ala He Ser Arg Asp 

405 410 415 

Gly Arg He Ala He Gly Asp Cys He Leu Ser He Asn Glu Glu Ser 
420 425 430 
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Thr He Ser Val Thr Asn Ala Gin Ala Arg Ala Met Leu Arg Arg His 
435 440 445 450 

Ser Leu He Gly Pro Asp He Lys lie Thr Tyr Val Pro Ala Glu His 

455 460 465 

Leu Glu Glu Phe Lys He Ser Leu Gly Gin Gin Ser Gly Arg Val Met 

470 475 480 

Ala Leu Asp lie Phe Ser Ser Tyr Thr Gly Arg Asp He Pro Glu Leu 

485 490 495 

Pro Glu Arg Glu Glu Gly Glu Gly Glu Glu Ser Glu Leu Gin Asn Thr 

500 505 510 

Ala Tyr Ser Asn Trp Asn Gin Pro Arg Arg Val Glu Leu Trp Arg Glu 
515 520 525 530 

Pro Ser Lys Ser Leu Gly He Ser He Val Gly Gly Arg Gly Met Gly 

535 540 545 

Ser Arg Leu Ser Asn Gly Glu Val Met Arg Gly He Phe He Lys His 

550 555 560 

Val Leu Glu Asp Ser Pro Ala Gly Lys Asn Gly Thr Leu Lys Pro Gly 

565 570 575 

Asp Arg He Val Glu Ala Pro Ser Gin Ser Glu Ser Glu Fro Glu Lys 

580 585 590 

Ala Pro Leu Cys Ser Val Pro Pro Pro Pro Pro Ser Ala Phe Ala Glu 
595 600 605 610 

Met Gly Ser Asp His Thr Gin Ser Ser Ala Ser Lys He Ser Gin Asp 

615 620 625 

Val Asp Lys Glu Asp Glu Phe Gly Tyr Ser Trp Lys Asn He Arg Glu 

630 635 640 

Arg Tyr Gly Thr Leu Thr Gly Glu Leu His Met He Glu Leu Glu Lys 
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645 650 655 

Gly His Ser Gly Leu Gly Leu Ser Leu Ala Gly Asn Lys Asp Arg Ser 

660 665 670 

Arg Met Ser Val Phe He Val Gly He Asp Pro Asn Gly Ala Ala Gly 
675 680 685 690 

Lys Asp Gly Arg Leu Gin He Ala Asp Glu Leu Leu Glu He Asn Gly 

695 700 705 

Gin lie Leu Tyr Gly Arg Ser His Gin Asn Ala Ser Ser He He Lys 

710 715 720 

Cys Ala Pro Ser Lys Val Lys He He Phe He Arg Asn Lys Asp Ala 

725 730 735 

Val Asn Gin Met Ala Val Cys Pro Gly Asn Ala Val Glu Pro Leu Pro 

740 745 750 

Ser Asn Ser Glu Asn Leu Gin Asn Lys Glu Thr Glu Pro Thr Val Thr 
755 760 765 770 

Thr Ser Asp Ala Ala Val Asp Leu Ser Ser Phe Lys Asn Val Gin His 

775 780 785 

Leu Glu Leu Pro Lys Asp Gin Gly Gly Leu Gly He Ala He Ser Glu 

790 795 800 

Glu Asp Thr Leu Ser Gly Val lie He Lys Ser Leu Thr Glu His Gly 

805 810 815 

Val Ala Ala Thr Asp Gly Arg Leu Lys Val Gly Asp Gin He Leu Ala 

820 825 830 

Val Asp Asp Glu He Val Val Gly Tyr Pro He Glu Lys Phe He Ser 
835 840 845 850 

Leu Leu Lys Thr Ala Lys Met Thr Val Lys Leu Thr He His Ala Glu 
855 860 865 
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Asn Pro Asp Ser Gin Ala Val Pro Ser Ala Ala Gly Ala Ala Ser Gly 

870 875 880 

Glu Lys Lys Asn Ser Ser Gin Ser Leu Met Val Pro Gin Ser Gly Ser 

885 890 895 

Pro Glu Pro Glu Ser He Arg Asn Thr Ser Arg Ser Ser Thr Pro Ala 

900 905 910 

lie Phe Ala Ser Asp Pro Ala Thr Cys Pro He He Pro Gly Cys Glu 
915 920 925 930 

Thr Thr He Glu He Ser Lys Gly Arg Thr Gly Leu Gly Leu Ser He 

935 940 945 

Val Gly Gly Ser Asp Thr Leu Leu Gly Ala Phe He He His Glu Val 

950 955 960 

Tyr Glu Glu Gly Ala Ala Cys Lys Asp Gly Arg Leu Trp Ala Gly Asp 

965 970 975 

Gin He Leu Glu Val Asn Gly He Asp Leu Arg Lys Ala Thr His Asp 

980 985 990 

Glu Ala He Asn Val Leu Arg Gin Thr Pro Gin Arg Val Arg Leu Thr 
995 1000 1005 1010 

Leu Tyr Arg Asp Glu Ala Pro Tyr Lys Glu Glu Glu Val Cys Asp Thr 

1015 1020 1025 

Leu Thr He Glu Leu Gin Lys Lys Pro Gly Lys Gly Leu Gly Leu Ser 

1030 1035 1040 

He Val Gly Lys Arg Asn Asp Thr Gly Val Phe Val Ser Asp He Val 

1045 1050 1055 

Lys Gly Gly He Ala Asp Pro Asp Gly Arg Leu He Gin Gly Asp Gin 

1060 1065 1070 

lie Leu Leu Val Asn Gly Glu Asp Val Arg Asn Ala Ser Gin Glu Ala 
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1075 1080 1085 1090 

Val Ala Ala Leu Leu Lys Cys Ser Leu Gly Thr Val Thr Leu Glu Val 

1095 1100 1105 

Gly Arg He Lys Ala Gly Pro Phe His Ser Glu Arg Arg Pro Ser Gin 

1110 1115 1120 

Thr Ser Gin Val Ser Glu Gly Ser Leu Ser Ser Phe Thr Phe Pro Leu 

1125 1130 1135 

Ser Gly Ser Ser Thr Ser Glu Ser Leu Glu Ser Ser Ser Lys Lys Asn 

1140 1145 1150 

Ala Leu Ala Ser Glu He Gin Gly Leu Arg Thr Val Glu Met Lys Lys 
1155 1160 1165 1170 

Gly Pro Thr Asp Ser Leu Gly He Ser He Ala Gly Gly Val Gly Ser 

1175 1180 1185 

Pro Leu Gly Asp Val Pro He Phe He Ala Met Met His Pro Thr Gly 

1190 1195 1200 

Val Ala Ala Gin Thr Gin Lys Leu Arg Val Gly Asp Arg He Val Thr 

1205 1210 1215 

He Cys Gly Thr Ser Thr Glu Gly Met Thr His Thr Gin Ala Val Asn 

1220 1225 1230 

Leu Leu Lys Asn Ala Ser Gly Ser He Glu Met Gin Val Val Ala Gly 
1235 1240 1245 1250 

Gly Asp Val Ser Val Val Thr Gly His His Gin Glu Pro Ala Ser Ser 

1255 1260 1265 

Ser Leu Ser Phe Thr Gly Leu Thr Ser Thr Ser He Phe Gin Asp Asp 

1270 1275 1280 

Leu Gly Pro Pro Gin Cys Lys Ser He Thr Leu Glu Arg Gly Pro Asp 
1285 1290 1295 



8/166 



61 y Leu Gly Phe Ser lie Val Gly 

1300 1305 
Leu Pro lie Tyr Val Lys Thr Val 
1315 1320 
Asp Gly Arg Leu Lys Arg Gly Asp 
1335 

Ser Leu Glu Gly Val Thr His Glu 
1350 

Thr Lys Gly Thr Val Thr Leu Met 
1365 1370 



Gly Tyr Gly Ser Pro His Gly Asp 
1310 

Phe Ala Lys Gly Ala Ala Ser Glu 
1325 1330 
Gin lie He Ala Val Asn Gly Gin 

1340 1345 
Glu Ala Val Ala ile Leu Lys Arg 
1355 1360 
Val Leu Ser 



<210> 2 
<211> 1005 
<212> PRT 

<213> Homo sapience 



<400> 2 

Met Leu Gin Asn Val Ser Lys Glu Ser Phe Glu Arg Thr Ile Asn Ile 

1 5 10 15 

Ala Lys Gly Asn Ser Ser Leu Gly Met Thr Val Ser Ala Asn Lys Asp 

20 25 30 

Gly Leu Gly Met Ile Val Arg Ser Ile Ile His Gly Gly Ala Ile Ser 

35 40 45 

Arg Asp Gly Arg Ile Ala Ile Gly Asp Cys Ile Leu Ser Ile Asn Glu 

50 55 60 

Glu Ser Thr lie Ser Val Thr Asn Ala Gin Ala Arg Ala Met Leu Arg 
65 70 75 80 
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Arg His Ser Leu He Gly Pro Asp He Lys He Thr Tyr Val Pro Ala 

85 90 95 

Glu His Leu Glu Glu Phe Lys He Ser Leu Gly Gin Gin Ser Gly Arg 

100 105 no 

Val Met Ala Leu Asp He Phe Ser Ser Tyr Thr Gly Arg Asp lie Pro 

115 120 125 

Glu Leu Pro Glu Arg Glu Glu Gly Glu Gly Glu Glu Ser Glu Leu Gin 

130 135 140 

Asn Thr Ala Tyr Ser Asn Trp Asn Gin Pro Arg Arg Val Glu Leu Trp 
I 45 150 155 160 

Arg Glu Pro Ser Lys Ser Leu Gly He Ser He Val Gly Gly Arg Gly 

165 170 175 

Met Gly Ser Arg Leu Ser Asn Gly Glu Val Met Arg Gly He Phe He 

180 185 190 

Lys His Val Leu Glu Asp Ser Pro Ala Gly Lys Asn Gly Thr Leu Lys 

195 200 205 

Pro Gly Asp Arg He Val Glu Ala Pro Ser Gin Ser Glu Ser Glu Pro 

210 215 220 

Glu Lys Ala Pro Leu Cys Ser Val Pro Pro Pro Pro Pro Ser Ala Phe 
225 230 235 240 

Ala Glu Met Gly Ser Asp His Thr Gin Ser Ser Ala Ser Lys He Ser 

245 250 255 

Gin Asp Val Asp Lys Glu Asp Glu Phe Gly Tyr Ser Trp Lys Asn He 

260 265 270 

Arg Glu Arg Tyr Gly Thr Leu Thr Gly Glu Leu His Met He Glu Leu 

275 280 285 

Glu Lys Gly His Ser Gly Leu Gly Leu Ser Leu Ala Gly Asn Lys Asp 



10/166 



290 295 300 

Arg Ser Arg Met Ser Val Phe lie Val Gly He Asp Pro Asn Gly Ala 
305 310 315 320 

Ala Gly Lys Asp Gly Arg Leu Gin He Ala Asp Glu Leu Leu Glu He 

325 330 335 

Asn Gly Gin He Leu Tyr Gly Arg Ser His Gin Asn Ala Ser Ser He 

340 345 350 

He Lys Cys Ala Pro Ser Lys Val Lys He He Phe He Arg Asn Lys 

355 360 365 

Asp Ala Val Asn Gin Met Ala Val Cys Pro Gly Asn Ala Val Glu Pro 

370 375 380 

Leu Pro Ser Asn Ser Glu Asn Leu Gin Asn Lys Glu Thr Glu Pro Thr 
385 390 395 400 

Val Thr Thr Ser Asp Ala Ala Val Asp Leu Ser Ser Phe Lys Asn Val 

405 410 415 

Gin His Leu Glu Leu Pro Lys Asp Gin Gly Gly Leu Gly He Ala He 

420 425 430 

Ser Glu Glu Asp Thr Leu Ser Gly Val He He Lys Ser Leu Thr Glu 

435 440 445 

His Gly Val Ala Ala Thr Asp Gly Arg Leu Lys Val Gly Asp Gin He 

450 455 460 

Leu Ala Val Asp Asp Glu He Val Val Gly Tyr Pro He Glu Lys Phe 
465 470 475 480 

lie Ser Leu Leu Lys Thr Ala Lys Met Thr Val Lys Leu Thr He His 

485 490 495 

Ala Glu Asn Pro Asp Ser Gin Ala Val Pro Ser Ala Ala Gly Ala Ala 
500 505 510 
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Ser Gly Glu Lys Lys Asn Ser Ser Gin Ser Leu Met Val Pro Gin Ser 

515 520 525 

Gly Ser Pro Glu Pro Glu Ser He Arg Asn Thr Ser Arg Ser Ser Thr 

530 535 540 

Pro Ala He Phe Ala Ser Asp Pro Ala Thr Cys Pro lie lie Pro Gly 
545 550 555 560 

Cys Glu Thr Thr He Glu He Ser Lys Gly Arg Thr Gly Leu Gly Leu 

565 570 575 

Ser He Val Gly Gly Ser Asp Thr Leu Leu Gly Ala Phe He He His 

580 585 590 

Glu Val Tyr Glu Glu Gly Ala Ala Cys Lys Asp Gly Arg Leu Trp Ala 

595 600 605 

Gly Asp Gin He Leu Glu Val Asn Gly He Asp Leu Arg Lys Ala Thr 

610 615 620 

His Asp Glu Ala He Asn Val Leu Arg Gin Thr Pro Gin Arg Val Arg 
625 630 635 640 

Leu Thr Leu Tyr Arg Asp Glu Ala Pro Tyr Lys Glu Glu Glu Val Cys 

64 5 650 655 

Asp Thr Leu Thr lie Glu Leu Gin Lys Lys Pro Gly Lys Gly Leu Gly 

660 665 670 

Leu Ser He Val Gly Lys Arg Asn Asp Thr Gly Val Phe Val Ser Asp 

675 680 685 

He Val Lys Gly Gly He Ala Asp Pro Asp Gly Arg Leu He Gin Gly 

690 695 700 

Asp Gin He Leu Leu Val Asn Gly Glu Asp Val Arg Asn Ala Ser Gin 
705 710 715 720 

Glu Ala Val Ala Ala Leu Leu Lys Cys Ser Leu Gly Thr Val Thr Leu 
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725 730 735 

Glu Val Gly Arg lie Lys Ala Gly Pro Phe His Ser Glu Arg Arg Pro 

740 745 750 

Ser Gin Thr Ser Gin Val Ser Glu Gly Ser Leu Ser Ser Phe Thr Phe 

755 760 765 

Pro Leu Ser Gly Ser Ser Thr Ser Glu Ser Leu Glu Ser Ser Ser Lys 

770 775 780 

Lys Asn Ala Leu Ala Ser Glu He Gin Gly Leu Arg Thr Val Glu Met 
785 790 795 800 

Lys Lys Gly Pro Thr Asp Ser Leu Gly He Ser He Ala Gly Gly Val 

805 810 815 

Gly Ser Pro Leu Gly Asp Val Pro lie Phe He Ala Met Met His Pro 

820 825 830 

Thr Gly Val Ala Ala Gin Thr Gin Lys Leu Arg Val Gly Asp Arg He 

835 840 845 

Val Thr He Cys Gly Thr Ser Thr Glu Gly Met Thr His Thr Gin Ala 

850 855 860 

Val Asn Leu Leu Lys Asn Ala Ser Gly Ser He Glu Met Gin Val Val 
865 870 875 880 

Ala Gly Gly Asp Val Ser Val Val Thr Gly His His Gin Glu Pro Ala 

885 890 895 

Ser Ser Ser Leu Ser Phe Thr Gly Leu Thr Ser Thr Ser lie Phe Gin 

900 905 910 

Asp Asp Leu Gly Pro Pro Gin Cys Lys Ser He Thr Leu Glu Arg Gly 

915 920 925 

Pro Asp Gly Leu Gly Phe Ser He Val Gly Gly Tyr Gly Ser Pro His 
930 935 940 
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Gly Asp Leu Pro He Tyr Val Lys Thr Val Phe Ala Lys Gly Ala Ala 
945 950 955 960 

Ser Glu Asp Gly Arg Leu Lys Arg Gly Asp Gin He lie Ala Val Asn 

965 970 975 

Gly Gin Ser Leu Glu Gly Val Thr His Glu Glu Ala Val Ala He Leu 

980 985 990 

Lys Arg Thr Lys Gly Thr Val Thr Leu Met Val Leu Ser 
995 1000 1005 

<210> 3 
<211> 4880 
<212> DNA 

<213> Homo sapience 

<220> 
<221> CDS 

<222> (292). ..(4410) 
<400> 3 

cccgggcccg ggcgacagtg ggacatcatt ttatccgatc tgttctacca gagggtcctg 60 
ttggacacag cgggaagctc ttcagtggag acgagctatt ggaaaataag taacgcattc 120 
agatgtttaa aatcacagag aatacaaaga taaagaatgg aaaagggtct ccttcctgtc 180 
ccaattcatc cagttctcat cacccttcat taggtaaatg gcataacttt acttggggaa 240 
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aatcaccaag atgtggtgaa tatcttaaaa gaactgccta tagaagtgac a atg gtg 297 

Met Val 

1 

tgc tgt cgt cga act gtg cca ccc acc acc caa tea gaa ttg gat age 345 
Cys Cys Arg Arg Thr Val Pro Pro Thr Thr Gin Ser Glu Leu Asp Ser 
5 10 15 

ctg gac tta tgt gat att gag eta aca gaa aag cct cac gta gat eta 393 
Leu Asp Leu Cys Asp He Glu Leu Thr Glu Lys Pro His Val Asp Leu 
20 25 30 

ggt gag ttc ate ggg tea tea gag aca gag gat cca gtg ctg gcg atg 441 
Gly Glu Phe He Gly Ser Ser Glu Thr Glu Asp Pro Val Leu Ala Met 
35 40 45 50 

act gat gcg ggt cag agt aca gaa gag gtt caa gca cct ttg gec atg 489 
Thr Asp Ala Gly Gin Ser Thr Glu Glu Val Gin Ala Pro Leu Ala Met 
55 60 . 65 

tgg gag get ggc att cag cac ata gag ctg gag aaa ggg age aaa gga 537 
Trp Glu Ala Gly He Gin His He Glu Leu Glu Lys Gly Ser Lys Gly 
70 75 80 



ctt ggt ttt age att tta gat tat cag gat cca att gat cca gca age 
Leu Gly Phe Ser He Leu Asp Tyr Gin Asp Pro lie Asp Pro Ala Ser 
85 90 95 



585 
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act gtg att ata att cgt tct ttg gtg cct ggc ggc att get gaa aag 633 
Thr Val He He lie Arg Ser Leu Val Pro Gly Gly He Ala Glu Lys 
100 105 110 

gat gga cga ctt ctt cct ggt gac cga etc atg ttt gta aac gat gtt 681 
Asp Gly Arg Leu Leu Pro Gly Asp Arg Leu Met Phe Val Asn Asp Val 
US 120 125 130 

aac ttg gaa aac age agt ctt gag gaa get gta gaa gca ctg aag gga 729 
Asn Leu Glu Asn Ser Ser Leu Glu Glu Ala Val Glu Ala Leu Lys Gly 
135 140 145 

gca ccg tea ggg act gtg aga ata gga gtt get aag cct tta ccc ctt 777 
Ala Pro Ser Gly Thr Val Arg He Gly Val Ala Lys Pro Leu Pro Leu 
150 155 160 

tea cca gaa gaa ggt tat gtt tct get aag gag gat tec ttt etc tac 825 
Ser Pro Glu Glu Gly Tyr Val Ser Ala Lys Glu Asp Ser Phe Leu Tyr 
165 170 175 

cca cca cac tec tgt gag gaa gca ggg ctg get gac aaa ccc etc ttc 873 
Pro Pro His Ser Cys Glu Glu Ala Gly Leu Ala Asp Lys Pro Leu Phe 
180 185 190 

agg get gac ttg get ctg gtg ggc aca aat gat get gac tta gta gat 921 
Arg Ala Asp Leu Ala Leu Val Gly Thr Asn Asp Ala Asp Leu Val Asp 



i 
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195 200 205 210 

gaa tec aca ttt gag tct cca tac tct cct gaa aat gac age ate tac 969 
Glu Ser Thr Phe Glu Ser Pro Tyr Ser Pro Glu Asn Asp Ser He Tyr 
215 220 225 

tct act caa gec tct att tta tct ctt cat ggc agt tct tgt ggt gat 1017 
Ser Thr Gin Ala Ser He Leu Ser Leu His Gly Ser Ser Cys Gly Asp 
230 235 240 

ggc ctg aac tat ggt tct tec ctt cca tea tct cct cct aag gat gtt 1065 
Gly Leu Asn Tyr Gly Ser Ser Leu Pro Ser Ser Pro Pro Lys Asp Val 
245 250 255 

att gaa aat tct tgt gat cca gta ctt gat ctg cat atg tct ctg gag 1113 
He Glu Asn Ser Cys Asp Pro Val Leu Asp Leu His Met Ser Leu Glu 
260 265 270 

gaa eta tat acc cag aat etc ctg gaa aga cag gat gag aat aca cct 1161 
Glu Leu Tyr Thr Gin Asn Leu Leu Glu Arg Gin Asp" Glu Asn Thr Pro 
275 280 285 290 

teg gtg gac ata agt atg ggg cct get tct ggc ttt act ata aat gac 1209 
Ser Val Asp He Ser Met Gly Pro Ala Ser Gly Phe Thr He Asn Asp 
295 300 305 

tac aca cct gca aat get att gaa caa caa tat gaa tgt gaa aac aca 1257 
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Tyr Thr Pro Ala Asn Ala He Glu Gin Gin Tyr Glu Cys Glu Asn Thr 
310 315 320 

ata gtg tgg act gaa tct cat tta cca agt gaa gtt ata tea agt gca 1305 
He Val Trp Thr Glu Ser His Leu Pro Ser Glu Val He Ser Ser Ala 
325 330 335 

gaa ctt cct tct gtg eta ccc gat tea get gga aag ggc tct gag cac 1353 
Glu Leu Pro Ser Val Leu Pro Asp Ser Ala Gly Lys Gly Ser Glu His 
340 345 350 

ctg ctt gaa cag age tec ctg gee tgt aat get gag tgt gtc atg ctt 1401 
Leu Leu Glu Gin Ser Ser Leu Ala Cys Asn Ala Glu Cys Val Met Leu 
355 360 365 370 

caa aat gta tct aaa gaa tct ttt gaa agg act att aat ata gca aaa 1449 
Gin Asn Val Ser Lys Glu Ser Phe Glu Arg Thr lie Asn He Ala Lys 
375 380 385 

ggc aat tct age eta gga atg aca gtt agt get aat aaa gat ggc ttg 1497 
Gly Asn Ser Ser Leu Gly Met Thr Val Ser Ala Asn Lys Asp Gly Leu 
390 395 400 

ggg atg ate gtt cga age att att cat gga ggt gee att agt cga gat 1545 
Gly Met lie Val Arg Ser He He His Gly Gly Ala He Ser Arg Asp 
405 410 415 
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ggc egg att gec att ggg gac tgc ate ttg tec att aat gaa gag tct 1593 
Gly Arg He Ala He Gly Asp Cys He Leu Ser He Asn Glu Glu Ser 
420 425 430 

acc ate agt gta ace aat gee cag gca cga get atg ttg aga aga cat 1641 
Thr He Ser Val Thr Asn Ala Gin Ala Arg Ala Met Leu Arg Arg His 
435 440 445 450 

tct etc att ggc cct gac ata aaa att act tat gtg cct gca gaa cat 1689 
Ser Leu He Gly Pro Asp He Lys He Thr Tyr Val Pro Ala Glu His 
455 460 465 

ttg gaa gag ttc aaa ata age ttg gga caa caa tct gga aga gta atg 1737 
Leu Glu Glu Phe Lys He Ser Leu Gly Gin Gin Ser Gly Arg Val Met 
470 475 480 

gca ctg gat att ttt tct tea tac act ggc aga gac att cca gaa tta 1785 
Ala Leu Asp He Phe Ser Ser Tyr Thr Gly Arg Asp He Pro Glu Leu 
485 490 495 

cca gag cga gaa gag gga gag ggt gaa gaa age gaa ctt caa aac aca 1833 
Pro Glu Arg Glu Glu Gly Glu Gly Glu Glu Ser Glu Leu Gin Asn Thr 
500 505 510 

gca tat age aat tgg aat cag ccc agg egg gtg gaa etc tgg aga gaa 1881 
Ala Tyr Ser Asn Trp Asn Gin Pro Arg Arg Val Glu Leu Trp Arg Glu 
515 520 525 530 
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cca age aaa tec tta ggc ate age att gtt ggt gga cga ggg atg ggg 1929 
Pro Ser Lys Ser Leu Gly He Ser He Val Gly Gly Arg Gly Met Gly 
535 540 545 

agt egg eta age aat gga gaa gtg atg agg ggc att ttc ate aaa cat 1977 

Ser Arg Leu Ser Asn Gly Glu Val Met Arg Gly He Phe He Lys His 
550 555 560 

gtt ctg gaa gat agt cca get ggc aaa aat gga acc ttg aaa cct gga 2025 

Val Leu Glu Asp Ser Pro Ala Gly Lys Asn Gly Thr Leu Lys Pro Gly 

565 570 575 

gat aga ate gta gag gca ccc agt cag tea gag tea gag cca gag aag 2073 

Asp Arg He Val Glu Ala Pro Ser Gin Ser Glu Ser Glu Pro Glu Lys 
580 585 590 

get cca ttg tgc agt gtg ccc cca ccc cct cct tea gec ttt gee gaa 2121 

Ala Pro Leu Cys Ser Val Pro Pro Pro Pro Pro Ser Ala Phe Ala Glu 
595 600 605 610 

atg ggt agt gat cac aca cag tea tct gca age aaa ate tea caa gat 2169 

Met Gly Ser Asp His Thr Gin Ser Ser Ala Ser Lys He Ser Gin Asp 
615 620 625 

gtg gac aaa gag gat gag ttt ggt tac age tgg aaa aat ate aga gag 2217 

Val Asp Lys Glu Asp Glu Phe Gly Tyr Ser Trp Lys Asn He Arg Glu 
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630 635 640 

cgt tat gga acc eta aca ggc gag ctg cat atg att gaa ctg gag aaa 2265 
Arg Tyr Gly Thr Leu Thr Gly Glu Leu His Met He Glu Leu Glu Lys 
645 650 655 

ggt cat agt ggt ttg ggc eta agt ctt get ggg aac aaa gac cga tec 2313 
Gly His Ser Gly Leu Gly Leu Ser Leu Ala Gly Asn Lys Asp Arg Ser 
660 665 670 

agg atg agt gtc ttc ata gtg ggg att gat cca aat gga get gca gga 2361 
Arg Met Ser Val Phe He Val Gly He Asp Pro Asn Gly Ala Ala Gly 
675 680 685 690 

aaa gat ggt cga ttg caa att gca gat gag ctt eta gag ate aat ggt 2409 
Lys Asp Gly Arg Leu Gin He Ala Asp Glu Leu Leu Glu lie Asn Gly 
695 700 705 

cag att tta tat gga aga agt cat cag aat gec tea tea ate att aaa 2457 
Gin lie Leu Tyr Gly Arg Ser His Gin Asn Ala Ser Ser He He Lys 
710 715 720 

tgt gec cct tct aaa gtg aaa ata att ttt ate aga aat aaa gat gca 2505 
Cys Ala Pro Ser Lys Val Lys He He Phe He Arg Asn Lys Asp Ala 
725 730 735 

gtg aat cag atg gee gta tgt cct gga aat gca gta gaa cct ttg cct 2553 
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Val Asn Gin Met Ala Val 
740 

tct aac tea gaa aat ctt 
Ser Asn Ser Glu Asn Leu 
755 760 

act tct gat gca get gtg 
Thr Ser Asp Ala Ala Val 
775 

ctg gag ctt ccc aag gat 
Leu Glu Leu Pro Lys Asp 
790 

gaa gat aca etc agt gga 
Glu Asp Thr Leu Ser Gly 
805 

gta gca gec acg gat gga 
Val Ala Ala Thr Asp Gly 
820 

gta gat gat gaa att gtt 
Val Asp Asp Glu He Val 
835 840 



Cys Pro Gly Asn Ala Val 
745 750 

caa aat aag gag aca gag 
Gin Asn Lys Glu Thr Glu 
765 

gac etc agt tea ttt aaa 
Asp Leu Ser Ser Phe Lys 
780 

cag ggg ggt ttg ggt att 
Gin Gly Gly Leu Gly He 
795 

gtc ate ata aag age tta 
Val He He Lys Ser Leu 
810 

cga etc aaa gtc gga gat 
Arg Leu Lys Val Gly Asp 
825 830 

gtt ggt tac cct att gaa 
Val Gly Tyr Pro He Glu 
845 



Glu Pro Leu Pro 

cca act gtt act 2601 
Pro Thr Val Thr 
770 

aat gtg caa cat 2649 
Asn Val Gin His 
785 

get ate age gaa 2697 
Ala lie Ser Glu 

- 800 

aca gag cat ggg 2745 
Thr Glu His Gly 
815 

cag ata ctg get 2793 
Gin He Leu Ala 

aag ttt att age 2841 
Lys Phe He Ser 
850 
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ctt ctg aag aca gca aag atg aca gta aaa ctt acc ate cat get gag 2889 
Leu Leu Lys Thr Ala Lys Met Thr Val Lys Leu Thr lie His Ala Glu 
855 860 865 

aat cca gat tec cag get gtt cct tea gca get ggt gca gec agt gga 2937 
Asn Pro Asp Ser Gin Ala Val Pro Ser Ala Ala Gly Ala Ala Ser Gly 
870 875 880 

gaa aaa aag aac age tec cag tct ctg atg gtc cca cag tct ggc tec 2985 
Glu Lys Lys Asn Ser Ser Gin Ser Leu Met Val Pro Gin Ser Gly Ser 
885 890 895 

cca gaa ccg gag tec ate cga aat aca age aga tea tea aca cca gca 3033 
Pro Glu Pro Glu Ser He Arg Asn Thr Ser Arg Ser Ser Thr Pro Ala 
900 905 910 

att ttt get tct gat cct gca acc tgc ccc att ate cct ggc tgc gaa 3081 
He Phe Ala Ser Asp Pro Ala Thr Cys Pro He He Pro Gly Cys Glu 
915 920 925 930 

aca acc ate gag att tec aaa ggg cga aca ggg ctg ggc ctg age ate 3129 
Thr Thr He Glu lie Ser Lys Gly Arg Thr Gly Leu Gly Leu Ser He 
935 940 945 



gtt ggg ggt tea gac acg ctg ctg ggt gee ttt att ate cat gaa gtt 
Val Gly Gly Ser Asp Thr Leu Leu Gly Ala Phe He He His Glu Val 
950 955 960 



3177 
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tat gaa gaa gga gca gca tgt aaa gat gga aga etc tgg get gga gat 3225 
Tyr Glu Glu Gly Ala Ala Cys Lys Asp Gly Arg Leu Trp Ala Gly Asp 
965 970 975 

cag ate tta gag gtg aat gga att gac ttg agg aag gec aca cat gat 3273 
Gin lie Leu Glu Val Asn Gly He Asp Leu Arg Lys Ala Thr His Asp 
980 985 990 

gaa gca ate aat gtc ctg aga cag acg cca cag aga gtg cgc ctg aca 3321 
Glu Ala lie Asn Val Leu Arg Gin Thr Pro Gin Arg Val Arg Leu Thr 
995 1000 1005 1010 

etc tac aga gat gag gee cca tac aaa gag gag gaa gtg tgt gac acc 3369 
Leu Tyr Arg Asp Glu Ala Pro Tyr Lys Glu Glu Glu Val Cys Asp Thr 
1015 1020 1025 

etc act att gag ctg cag aag aag ccg gga aaa ggc eta gga tta agt 3417 
Leu Thr He Glu Leu Gin Lys Lys Pro Gly Lys Gly Leu Gly Leu Ser 
1030 1035 1040 

att gtt ggt aaa aga aac gat act gga gta ttt gtg tea gac att gtc 3465 
He Val Gly Lys Arg Asn Asp Thr Gly Val Phe Val Ser Asp He Val 
1045 1050 1055 

aaa gga gga att gca gat ccc gat gga aga ctg ate cag gga gac cag 3513 
Lys Gly Gly He Ala Asp Pro Asp Gly Arg Leu He Gin Gly Asp Gin 
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1060 1065 1070 

ata tta ttg gtg aat ggg gaa gac gtt cgt aat gcc tec caa gaa gcg 3561 
He Leu Leu Val Asn Gly Glu Asp Val Arg Asn Ala Ser Gin Glu Ala 
!075 1080 1085 1090 

gtt gcc get ttg eta aag tgt tec eta ggc aca gta ace ttg gaa gtt 3609 
Val Ala Ala Leu Leu Lys Cys Ser Leu Gly Thr Val Thr Leu Glu Val 
1095 1100 1105 

gga aga ate aaa get ggt cca ttc cat tea gag agg agg cca tct caa 3657 
Gly Arg He Lys Ala Gly Pro Phe His Ser Glu Arg Arg Pro Ser Gin 
1110 1115 1120 

acc age cag gtg agt gaa ggc age ctg tct tct ttc act ttt cca etc 3705 
Thr Ser Gin Val Ser Glu Gly Ser Leu Ser Ser Phe Thr Phe Pro Leu 
1125 1130 1135 

tct gga tec agt aca tct gag tea ctg gaa agt age tea aag aag aat 3753 
Ser Gly Ser Ser Thr Ser Glu Ser Leu Glu Ser Ser Ser Lys Lys Asn 
1140 1145 1150 

gca ttg gca tct gaa ata cag gga tta aga aca gtc gaa atg aaa aag 3801 
Ala Leu Ala Ser Glu He Gin Gly Leu Arg Thr Val Glu Met Lys Lys 
1155 1160 1165 1170 

ggc cct act gac tea ctg gga ate age ate get gga gga gta ggc age 3849 
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Gly Pro Thr Asp Ser Leu Gly He Ser He Ala Gly Gly Val Gly Ser 
1175 1180 1185 



cca ctt ggt gat gtg cct ata ttt att gca atg atg cac cca act gga 
Pro Leu Gly Asp Val Pro He Phe He Ala Met Met His Pro Thr Gly 
1190 1195 1200 



3897 



gtt gca gca cag acc caa aaa etc aga gtt ggg gat agg att gtc acc 
Val Ala Ala Gin Thr Gin Lys Leu Arg Val Gly Asp Arg He Val Thr 
1205 1210 1215 



3945 



ate tgt ggc aca tec act gag ggc atg act cac acc caa gca gtt aac 
He Cys Gly Thr Ser Thr Glu Gly Met Thr His Thr Gin Ala Val Asn 
1220 1225 1230 



3993 



eta ctg aaa aat gca tct ggc tec att gaa atg cag gtg gtt get gga 
Leu Leu Lys Asn Ala Ser Gly Ser He Glu Met Gin Val Val Ala Gly 
1235 1240 1245 1250 



4041 



gga gac gtg agt gtg gtc aca ggt cat cat cag gag cct gca agt tec 
Gly Asp Val Ser Val Val Thr Gly His His Gin Glu Pro Ala Ser Ser 
1255 1260 1265 



4089 



agt ctt tct ttc act ggg ctg acg tea acc agt ata ttt cag gat gat 
Ser Leu Ser Phe Thr Gly Leu Thr Ser Thr Ser lie Phe Gin Asp Asp 
1270 1275 1280 



4137 
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tta gga cct cct caa tgt aag tct att aca eta gag cga gga cca gat 4185 
Leu Gly Pro Pro Gin Cys Lys Ser lie Thr Leu Glu Arg Gly Pro Asp 
1285 1290 1295 

ggc tta ggc ttc agt ata gtt gga gga tat ggc age cct cat gga gac 4233 
Gly Leu Gly Phe Ser lie Val Gly Gly Tyr Gly Ser Pro His Gly Asp 
1300 1305 1310 

tta ccc att tat gtt aaa aca gtg ttt gca aag gga gca gec tct gaa 4281 
Leu Pro He Tyr Val Lys Thr Val Phe Ala Lys Gly Ala Ala Ser Glu 
1315 1320 1325 1330 

gac gga cgt ctg aaa agg ggc gat cag ate att get gtc aat ggg cag 4329 
Asp Gly Arg Leu Lys Arg Gly Asp Gin He He Ala Val Asn Gly Gin 
1335 1340 1345 

agt eta gaa gga gtc ace cat gaa gaa get gtt gec ate ctt aaa egg 4377 
Ser Leu Glu Gly Val Thr His Glu Glu Ala Val Ala He Leu Lys Arg 
1350 1355 1.360 

aca aaa ggc act gtc act ttg atg gtt etc tct tgaattggct gecagaattg 4430 
Thr Lys Gly Thr Val Thr Leu Met Val Leu Ser 
1365 1370 

aaccaaccca acccctagct cacctcctac tgtaaagaga atgcactggt cctgacaatt 4490 



tttatgctgt gttcagcegg gtcttcaaaa ctgtaggggg gaaataacac ttaagtttct 4550 
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ttttctcatc tagaaatgct ttccttactg acaacctaac atcatttttc ttttcttctt 4610 

gcattttgtg aacttaaaga gaaggaatat ttgtgtaggt gaatctcgtt tttatttgtg 4670 

gagatatcta atgttttgta gtcacatggg caagaattat tacatgctaa gctggttagt 4730 

ataaagaaag ataattctaa agctaaccaa agaaaatggc ttcagtaagt taggatgaaa 4790 

aatgaaaata taaaataaag aagaaaatct cggggagttt aaaaaaaatg cctcaatttg 4850 

gcaatctacc tcctctcccc accccaaact 

<210> 4 
<211> 90 
<212> PET 

< 2 1 3 > Homo sapience 
<400> 4 

Ala Gly He Gin His He Glu Leu Glu Lys Gly Ser Lys Gly Leu Gly 

1 5 io is 

Phe Ser He Leu Asp Tyr Gin Asp Pro He Asp Pro Ala Ser Thr Val 

20 25 30 

He He He Arg Ser Leu Val Pro Gly Gly He Ala Glu Lys Asp Gly 

35 40 45 

Arg Leu Leu Pro Gly Asp Arg Leu Met Phe Val Asn Asp Val Asn Leu 
50 55 60 
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Glu Asn Ser Ser Leu Glu Glu Ala Val Glu Ala Leu Lys Gly Ala Pro 

65 70 75 80 

Ser Gly Thr Val Arg He Gly Val Ala Lys 
85 90 



<210> 5 

<211> 91 

<212> PUT 

<213> Homo sapience 



<400> 5 

Gin Asn Val Ser Lys Glu Ser Phe 

1 5 
Gly Asn Ser Ser Leu Gly Met Thr 
20 

Gly Met lie Val Arg Ser He lie 
35 40 
Gly Arg He Ala He Gly Asp Cys 

50 55 
Thr lie Ser Val Thr Asn Ala Gin 

65 70 
Ser Leu He Gly Pro Asp lie Lys 
85 



Glu Arg Thr He Asn lie Ala Lys 

10 15 
Val Ser Ala Asn Lys Asp Gly Leu 
25 30 
His Gly Gly Ala lie Ser Arg Asp 
45 

lie Leu Ser He Asn Glu Glu Ser 
60 

Ala Arg Ala Met Leu Arg Arg His 
75 80 
lie Thr Tyr 
90 



<210> 6 
<211> 96 
<212> PRT 
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< 2 1 3 > Hobo sapience 



<400> 6 

Asn Gin Pro Arg Arg Val Glu Leu 

1 5 
Gly lie Ser He Val Gly Gly Arg 
20 

Gly Glu Val Met Arg Gly He Phe 
35 40 
Pro Ala Gly Lys Asn Gly Thr Leu 

50 55 
Ala Pro Ser Gin Ser Glu Ser Glu 

65 70 
Val Pro Pro Pro Pro Pro Ser Ala 
85 



Trp Arg Glu Pro Ser Lys Ser Leu 

10 15 
Gly Met Gly Ser Arg Leu Ser Asn 
25 30 
He Lys His Val Leu Glu Asp Ser 
45 

Lys Pro Gly Asp Arg He Val Glu 
60 

Pro Glu Lys Ala Pro Leu Cys Ser 
75 80 
Phe Ala Glu Met Gly Ser Asp His 
90 95 



<210> 7 

<211> 86 

<212> PRT 

<213> Hoio sapience 



<400> 7 

Gly Glu Leu His Met He Glu Leu 

1 5 
Leu Ser Leu Ala Gly Asn Lys Asp 
20 

Val Gly He Asp Pro Asn Gly Ala 



Glu Lys Gly His Ser Gly Leu Gly 

10 15 

Arg Ser Arg Met Ser Val Phe He 
25 30 

Ala Gly Lys Asp Gly Arg Leu Gin 
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35 40 45 

lie Ala Asp Glu Leu Leu Glu He Asn Gly Gin lie Leu Tyr Gly Arg 

50 55 60 

Ser His Gin Asn Ala Ser Ser He lie Lys Cys Ala Pro Ser Lys Val 
65 70 75 80 

Lys He lie Phe He Arg 
85 



<210> 8 
<211> 84 
<212> PET 

<213> Homo sapience 



<400> 8 

Lys Asn Val Gin His Leu Glu Leu 

1 5 
He Ala lie Ser Glu Glu Asp Thr 
20 

Leu Thr Glu His Gly Val Ala Ala 
35 40 
Asp Gin He Leu Ala Val Asp Asp 

50 55 
Glu Lys Phe He Ser Leu Leu Lys 

65 70 
Thr He His Ala 



Pro Lys Asp Gin Gly Gly Leu Gly 

10 15 
Leu Ser Gly Val He He Lys Ser 
25 30 
Thr Asp Gly Arg Leu Lys Val Gly 
45 

Glu He Val Val Gly Tyr Pro He 
60 

Thr Ala Lys Met Thr Val Lys Leu 
75 80 



<210> 9 
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< 2 1 1 > 86 
<212> PET 

<213> Homo sapience 
<400> 9 

Gly Cys Glu Thr Thr He Glu He Ser Lys Gly Arg Thr Gly Leu Gly 

1 5 10 15 

Leu Ser He Val Gly Gly Ser Asp Thr Leu Leu Gly Ala Phe He He 

20 25 30 

His Glu Val Tyr Glu Glu Gly Ala Ala Cys Lys Asp Gly Arg Leu Trp 

35 40 45 

Ala Gly Asp Gin He Leu Glu Val Asn Gly He Asp Leu Arg Lys Ala 

50 55 60 

Thr His Asp Glu Ala He Asn Val Leu Arg Gin Thr Pro Gin Arg Val 
65 70 75 80 

Arg Leu Thr Leu Tyr Arg 
85 

<210> 10 
<211> 85 
<212> PRT 

< 2 1 3 > Hobo sapience 
<400> 10 

Cys Asp Thr Leu Thr He Glu Leu Gin Lys Lys Pro Gly Lys Gly Leu 

1 5 10 15 

Gly Leu Ser lie Val Gly Lys Arg Asn Asp Thr Gly Val Phe Val Ser 
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20 25 30 

Asp lie Val Lys Gly Gly lie Ala Asp Pro Asp Gly Arg Leu He Gin 

35 40 45 

Gly Asp Gin lie Leu Leu Val Asn Gly Glu Asp Val Arg Asn Ala Ser 

50 55 60 

Gin Glu Ala Val Ala Ala Leu Leu Lys Cys Ser Leu Gly Thr Val Thr 
65 70 75 80 

Leu Glu Val Gly Arg 
85 



<210> 11 

<211> 89 

<212> PET 

< 2 1 3 > Homo sapience 



<400> 11 

Gin Gly Leu Arg Thr Val Glu Met 

1 5 
Gly He Ser He Ala Gly Gly Val 
20 

He Phe lie Ala Met Met His Pro 
35 40 
Lys Leu Arg Val Gly Asp Arg He 

50 55 
Glu Gly Met Thr His Thr Gin Ala 

65 70 
Gly Ser He Glu Met Gin Val Val 



Lys Lys Gly Pro Thr Asp Ser Leu 

10 15 
Gly Ser Pro Leu Gly Asp Val Pro 
25 30 
Thr Gly Val Ala Ala Gin Thr Gin 
45 

Val Thr lie Cys Gly Thr Ser Thr 
60 

Val Asn Leu Leu Lys Asn Ala Ser 
75 80 

Ala 
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85 

<210> 12 
<211> 88 
<212> PRT 

< 2 1 3 > Homo sapience 
<400> 12 

Pro Gin Cys Lys Ser He Thr Leu Glu Arg Gly Pro Asp Gly Leu Gly 

1 5 10 15 

Phe Ser He Val Gly Gly Tyr Gly Ser Pro His Gly Asp Leu Pro He 

20 25 30 

Tyr Val Lys Thr Val Phe Ala Lys Gly Ala Ala Ser Glu Asp Gly Arg 

35 40 45 

Leu Lys Arg Gly Asp Gin He He Ala Val Asn Gly Gin Ser Leu Glu 

50 55 60 

Gly Val Thr His Glu Glu Ala Val Ala He Leu Lys Arg Thr Lys Gly 
65 70 75 80 

Thr Val Thr Leu Met Val Leu Ser 
85 

<210> 13 
<211> 184 
<212> DNA 

<213> Homo sapience 
<400> 132 
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gctattttga aaatatattt atatctacga aaagaattgg gaaaacaaat atttaatcag 60 

agaattattc cttaaagatt taaaatgtat ttagttgtac attttatatg ggttcacccc 120 

agcacatgaa gtataatggt cagatttatt tngtatttat ttactattat aaccactttt 180 

tagg 184 

<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 14 

ctccccatcc ctcgtccacc 20 

<210> 15 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Artificially Synthesized Primer Sequence 



<400> 15 

ctctgactct gactgactgg 



20 



35/166 



<210> 16 
<211> 20 
<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 16 

atgagtttgg ttacagctgg 

<210> 17 
<211> 20 
<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 17 

tcagagagcg ttatggaacc 

<210> 18 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 18 

agtcttgctg ggaacaaaga 

<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 19 

actgttacta cttctgatgc 

<210> 20 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 20 

tctgatggtc ccacagtctg 
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<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 21 

gttgtttcgc agccagggat 

<210> 22 
<211> 20 
<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 22 

ctgagcatcg ttgggggttc 

<210> 23 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 23 

cctcatctct gtagagtgtc 

<210> 24 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 24 

tgttagcccc ctcactaagg 

<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 25 
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gctatgtgct aggaaatacg 

<210> 26 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

< 223 > Artificially Synthesized Primer Sequence 

<400> 26 

tagggagaag gatcagagcg 

<210> 27 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 27 

acagatttct gactcactgg 



<210> 28 
<211> 20 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 

<400> 28 

tggaaatagg cattcttcag 

<210> 29 

<211> 20 

<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 

<400> 29 

atacaaagac ggtctaatcc 

<210> 30 

<211> 21 

<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
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<400> 30 

ccgctttccc atctttagaaa c 

<210> 31 
<211> 20 
<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 31 

tatctcgtgt ggaagatgtg 

<210> 32 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 32 

acataaatgt tgctatcacc 



<210> 33 
<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 33 

tgccacttag tagccgagtg 

<210> 34 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 34 

gcattgcatt acagttgagc 

<210> 35 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
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<400> 35 

tcctcctttg acaatgtctg 

<210> 36 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 

<400> 36 

catttcgact gttcttaatc 

<210> 37 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 37 

tcagtggatg tgccacagat 



<210> 38 
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< 2 1 1 > 20 
<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 38 

cagtaggtta actgcttcgg 

<210> 39 
<211> 20 
<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 39 

agttccagtc tttctttcgg 

<210> 40 

<211> 21 

<212> DNA 

< 2 1 3 > Artificial Sequence 



<220> 
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<223> Artificially Synthesized Primer Sequence 
<400> 40 

tttctttcac tgggctgaagt c 

<210> 41 
<211> 20 
<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 41 

cctctgaaga cggacgtctg 

<210> 42 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 42 

ccatcctaat acgactcact atagggc 
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<210> 43 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 43 

ttggggtggg gagaggaggt agattgc 

<210> 44 

<211> 23 

<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 

<400> 44 

actcactata gggctcgagc ggc 

<210> 45 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 45 

gcacatcacc aagtgggctg cctactc 

<210> 46 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 

<400> 46 

atgagtttgg ttacagctgg 

<210> 47 

<211> 20 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Artificially Synthesized Primer Sequence 

<400> 47 

aatctaatgc agctcgcctg 
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<210> 48 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 

<400> 48 



Q 

m 
a 
ru 
m 

m 



agtcttgctg ggaacaaaga 



<212> DNA 



<210> 49 



<211> 20 



20 



ru 

G 

O 



<213> Artificial Sequence 



<220> 



<223> Artificially Synthesized Primer Sequence 



<400> 49 



tcactttaga aggggcacat 



20 



<210> 50 



<211> 20 



<212> DNA 



< 2 1 3 > Artificial Sequence 
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<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 50 

actgttacta cttctgatgc 

<210> 51 
<211> 20 
<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 51 

tctgatggtc ccacagtctg 

<210> 52 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 52 
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gttgtttcgc agccagggat 

<210> 53 
<211> 20 
<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 53 

ctgagcatcg ttgggggttc 

<210> 54 
<211> 20 
<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 54 

cctcatctct gtagagtgtc 



<210> 55 
<211> 20 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 55 

tagggagaag gatcagagcg 

<210> 56 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 56 

tcctcctttg acaatgtctg 

<210> 57 

<211> 18 

<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
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<400> 57 

tttcatcatc tacagccagt 20 

<210> 58 
<211> 20 
<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
:S <400> 58 

□ tgacaccctc actattgagc 20 

S <210> 59 

L <211> 2819 

IS <212> DNA 

_^ < 2 1 3 > Homo sapience 

<220> 
<221> CDS 

<222> (43). ..(2331) 
<400> 59 

accaccgcct ccgcggcacc ccctccttca gcctttgccg aa atg ggt agt aat 54 

Met Gly Ser Asn 
1 
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cac aca cag tea tct gca age aaa ate tea caa gat gtg gac aaa gag 102 
His Thr Gin Ser Ser Ala Ser Lys He Ser Gin Asp Val Asp Lys Glu 
5 10 15 20 

gat gag ttt ggt tac age tgg aaa aat ate aga gag cgt tat gga ace 150 
Asp Glu Phe Gly Tyr Ser Trp Lys Asn lie Arg Glu Arg Tyr Gly Thr 
25 30 35 

eta aca ggc gag ctg cat atg att gaa ctg gag aaa ggt cat agt ggt 198 
Leu Thr Gly Glu Leu His Met He Glu Leu Glu Lys Gly His Ser Gly 
40 45 50 

ttg ggc eta agt ctt get ggg aac aaa gac cga tec agg atg agt gtc 246 
Leu Gly Leu Ser Leu Ala Gly Asn Lys Asp Arg Ser Arg Met Ser Val 
55 60 65 

ttc ata gtg ggg att gat cca aat gga get gca gga aaa gat ggt cga 294 
Phe He Val Gly He Asp Pro Asn Gly Ala Ala Gly Lys Asp Gly Arg 
70 75 80 

ttg caa att gca gat gag ctt eta gag ate aat ggt cag att tta tat 342 
Leu Gin lie Ala Asp Glu Leu Leu Glu He Asn Gly Gin lie Leu Tyr 
85 90 95 100 



gga aga agt cat cag aat gee tea tea ate att aaa tgt gee cct tct 390 
Gly Arg Ser His Gin Asn Ala Ser Ser He He Lys Cys Ala Pro Ser 
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105 110 115 

aaa gtg aaa ata att ttt ate aga aat aaa gat gca gtg aat cag atg 438 
Lys Val Lys He He Phe He Arg Asn Lys Asp Ala Val Asn Gin Met 
120 125 130 

gec gta tgt cct gga aat gca gta gaa cct ttg cct tct aac tea gaa 486 
Ala Val Cys Pro Gly Asn Ala Val Glu Pro Leu Pro Ser Asn Ser Glu 
135 140 145 

aat ctt caa aat aag gag cca gag cca act gtt act act tct gat gca 534 
Asn Leu Gin Asn Lys Glu Pro Glu Pro Thr Val Thr Thr Ser Asp Ala 
150 155 160 

get gtg gac etc agt tea ttt aaa aat gtg caa cat ctg gag ctt ccc 582 
Ala Val Asp Leu Ser Ser Phe Lys Asn Val Gin His Leu Glu Leu Pro 
165 170 175 180 

aag gat cag ggg ggt ttg ggt att get ate age gaa gaa gat aca etc 630 
Lys Asp Gin Gly Gly Leu Gly He Ala He Ser Glu Glu Asp Thr Leu 
185 190 195 

agt gga gtc ate ata aag age tta aca gag cat ggg gta gca gec acg 678 
Ser Gly Val He He Lys Ser Leu Thr Glu His Gly Val Ala Ala Thr 
200 205 210 

gat gga cga etc aaa gtc gga gat cag ata ctg get gta gat gat gaa 726 
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Asp Gly Arg Leu Lys Val Gly Asp Gin He Leu Ala Val Asp Asp Glu 

215 220 225 

att gtt gtt ggt tac cct att gaa aag ttt att age ctt ctg aag aca 774 

lie Val Val Gly Tyr Pro He Glu Lys Phe He Ser Leu Leu Lys Thr 

230 235 240 

gca aag atg aca gta aaa ctt acc ate cat get gag aat cca gat tec 822 

Ala Lys Met Thr Val Lys Leu Thr lie His Ala Glu Asn Pro Asp Ser 

245 250 255 260 

S3, 

^Z: cag get gtt cct tea gca get ggt gca gec agt gga gaa aaa aag aac 870 

Q Gin Ala Val Pro Ser Ala Ala Gly Ala Ala Ser Gly Glu Lys Lys Asn 

Hi 265 270 275 

!™ age tec cag tct ctg atg gtc cca cag tct ggc tec cca gaa ccg gag 918 

HI Ser Ser Gin Ser Leu Met Val Pro Gin Ser Gly Ser Pro Glu Pro Glu 

U 280 285 290 



tec ate cga aat aca age aga tea tea aca cca gca att ttt get tct 966 
Ser He Arg Asn Thr Ser Arg Ser Ser Thr Pro Ala He Phe Ala Ser 
295 300 305 

gat cct gca acc tgc ccc att ate cct ggc tgc gaa aca acc ate gag 1014 
Asp Pro Ala Thr Cys Pro He He Pro Gly Cys Glu Thr Thr He Glu 
310 315 320 
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att tec aaa ggg cga aca ggg ctg ggc ctg age ate gtt ggg ggt tea 1062 
lie Ser Lys Gly Arg Thr Gly Leu Gly Leu Ser lie Val Gly Gly Ser 
325 330 335 340 

gac acg ctg ctg ggt gec ttt att ate cat gaa gtt tat gaa gaa gga 1110 
Asp Thr Leu Leu Gly Ala Phe He He His Glu Val Tyr Glu Glu Gly 
345 350 355 

gca gca tgt aaa gat gga aga etc tgg get gga gat cag ate tta gag 1158 
Ala Ala Cys Lys Asp Gly Arg Leu Trp Ala Gly Asp Gin He Leu Glu 
360 365 370 

gtg aat gga att gac ttg agg aag gee aca cat gat gaa gca ate aat 1206 
Val Asn Gly He Asp Leu Arg Lys Ala Thr His Asp Glu Ala He Asn 
375 380 385 

gtc ctg aga cag acg cca cag aga gtg cgc ctg aca etc tac aga gat 1254 
Val Leu Arg Gin Thr Pro Gin Arg Val Arg Leu Thr Leu Tyr Arg Asp 
390 395 400 

gag gec cca tac aaa gag gag gaa gtg tgt gac ace etc act att gag 1302 
Glu Ala Pro Tyr Lys Glu Glu Glu Val Cys Asp Thr Leu Thr He Glu 
405 410 415 420 

ctg cag aag aag ccg gga aaa ggc eta gga tta agt att gtt ggt aaa 1350 
Leu Gin Lys Lys Pro Gly Lys Gly Leu Gly Leu Ser He Val Gly Lys 
425 430 435 



57/166 



aga aac gat act gga gta ttt gtg tea gac att gtc aaa gga gga att 1398 
Arg Asn Asp Thr Gly Val Phe Val Ser Asp He Val Lys Gly Gly He 
440 445 450 

gca gat ccc gat gga aga ctg ate cag gga gac cag ata tta ttg gtg 1446 
Ala Asp Pro Asp Gly Arg Leu He Gin Gly Asp Gin He Leu Leu Val 
455 460 465 

aat ggg gaa gac gtt cgt aat gee tec caa gaa gcg gtt gec get ttg 1494 
Asn Gly Glu Asp Val Arg Asn Ala Ser Gin Glu Ala Val Ala Ala Leu 
470 475 480 

eta aag tgt tec eta ggc aca gta ace ttg gaa gtt gga aga ate aaa 1542 
Leu Lys Cys Ser Leu Gly Thr Val Thr Leu Glu Val Gly Arg He Lys 
485 490 495 500 

get ggt cca ttc cat tea gag agg agg cca tct caa ace age cag gtg 1590 
Ala Gly Pro Phe His Ser Glu Arg Arg Pro Ser Gin Thr Ser Gin Val 
505 510 515 

agt gaa ggc age ctg tct tct ttc act ttt cca etc tct gga tec agt 1638 
Ser Glu Gly Ser Leu Ser Ser Phe Thr Phe Pro Leu Ser Gly Ser Ser 
520 525 530 

aca tct gag tea ctg gaa agt age tea aag aag aat gca ttg gca tct 1686 
Thr Ser Glu Ser Leu Glu Ser Ser Ser Lys Lys Asn Ala Leu Ala Ser 
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535 

gaa ata cag gga tta aga 
Glu He Gin Gly Leu Arg 
550 

tea ctg gga ate age ate 
Ser Leu Gly He Ser He 
565 570 

gtg cct ata ttt att gca 
Val Pro He Phe He Ala 
585 

acc caa aaa etc aga gtt 
Thr Gin Lys Leu Arg Val 
600 

tec act gag ggc atg act 
Ser Thr Glu Gly Met Thr 
615 

gca tct ggc tec att gaa 
Ala Ser Gly Ser He Glu 
630 

gtg gtc aca ggt cat cat 
Val Val Thr Gly His His 



540 

aca gtc gaa atg aaa aag 
Thr Val Glu Met Lys Lys 
555 560 

get gga gga gta ggc age 
Ala Gly Gly Val Gly Ser 
575 

atg atg cac cca act gga 
Met Met His Pro Thr Gly 
590 

ggg gat agg att gtc acc 
Gly Asp Arg He Val Thr 
605 

cac acc caa gca gtt aac 
His Thr Gin Ala Val Asn 
620 

atg cag gtg gtt get gga 
Met Gin Val Val Ala Gly 
635 640 

cag gag cct gca agt tec 
Gin Glu Pro Ala Ser Ser 



545 

ggc cct act gac 1734 
Gly Pro Thr Asp 

cca ctt ggt gat 1782 
Pro Leu Gly Asp 
580 

gtt gca gca cag 1830 
Val Ala Ala Gin 
595 

ate tgt ggc aca 1878 
He Cys Gly Thr 
610 

eta ctg aaa aat 1926 

Leu Leu Lys Asn 

625 

gga gac gtg agt 1974 
Gly Asp Val Ser 

agt ctt tct ttc 2022 
Ser Leu Ser Phe 
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645 



650 



655 



660 



act ggg ctg acg tea acc agt ata ttt cag gat gat tta gga cct cct 
Thr Gly Leu Thr Ser Thr Ser He Phe Gin Asp Asp Leu Gly Pro Pro 
665 670 675 



2070 



caa tgt aag tct att aca eta gag cga gga cca gat ggc tta ggc ttc 
Gin Cys Lys Ser He Thr Leu Glu Arg Gly Pro Asp Gly Leu Gly Phe 
680 685 690 



2118 



agt ata gtt gga gga tat ggc age cct cat gga gac tta ccc att tat 
Ser He Val Gly Gly Tyr Gly Ser Pro His Gly Asp Leu Pro He Tyr 
695 700 705 



2166 



gtt aaa aca gtg ttt gca aag gga gca gec tct gaa gac gga cgt ctg 
Val Lys Thr Val Phe Ala Lys Gly Ala Ala Ser Glu Asp Gly Arg Leu 
710 715 720 



2214 



aaa agg ggc gat cag ate att get gtc aat ggg cag agt eta gaa gga 
Lys Arg Gly Asp Gin He He Ala Val Asn Gly Gin Ser Leu Glu Gly 
725 730 735 740 



2262 



gtc acc cat gaa gaa get gtt gec ate ctt aaa egg aca aaa ggc act 
Val Thr His Glu Glu Ala Val Ala He Leu Lys Arg Thr Lys Gly Thr 
745 750 755 



2310 



gtc act ttg atg gtt etc tct tgaattggct gecagaattg aaccaaccca 



2361 
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Val Thr Leu Met Val Leu Ser 
760 

acccctagct cacctcctac tgtaaagaga atgcactggt cctgacaatt tttatgctgt 2421 

gttcagccgg gtcttcaaaa ctgtaggggg gaaataacac ttaagtttct ttttctcatc 2481 

tagaaatgct ttccttactg acaacctaac atcatttttc ttttcttctt gcattttgtg 2541 

aacttaaaga gaaggaatat ttgtgtaggt gaatctcgtt tttatttgtg gagatatcta 2601 

atgttttgta gtcacatggg caagaattat tacatgctaa gctggttagt ataaagaaag 2661 

ataattctaa agctaaccaa agaaaatggc ttcagtaagt taggatgaaa aatgaaaata 2721 

taaaataaag aagaaaatct cggggagttt aaaaaaaatg cctcaatttg gcaatctacc 2781 

tcctctcccc accccaaact aaaaaaaaaa aaaaaaaa 2819 

<210> 60 

<211> 25 

<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
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<400> 60 

gcccttagga cgcgtaatac gactc 

<210> 61 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 61 

agccagtatc tgatctccga ctttg 

<210> 62 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 

<400> 62 

attttcactt tagaaggggc acat 

<210> 63 „ 



<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 63 

ggcataactt tacttacttg 

<210> 64 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 64 

atctactaag tcagcatcat 

<210> 65 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
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<400> 65 

atttgcaggt gtgtagtcat 

<210> 66 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 

<400> 66 

ttccttctgt gctacccgat 

<210> 67 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 

<400> 67 

ggactatctt ccagaacatg 



<210> 68 
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<211> 25 
<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 68 

atcgggtcca ttccattcag agagg 

<210> 69 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 69 

aattgtcaag agagaaccat caaagtgg 

<210> 70 
<211> 25 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Artificially Synthesized Primer Sequence 
<400> 7 

atcgatgggt agtaatcaca cacag 25 

<210> 71 
<21 1 > 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 71 

aattgctata ctggatccag agagtgg 27 

<210> 72 
<211> 21 
<212> PET 

<213> Artificial Sequence 

<220> 

<223> Artificially Synthesized Peptide Sequence 
<400> 72 

Val Asp Lys Glu Asp Glu Phe Gly Tyr Ser Trp Lys Asn He Arg Giu 
1 5 10 15 
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Arg Tyr Gly Cys Gly 
20 

<210> 73 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 73 

tttgtgccca ccagagccaa gtcag 

<210> 74 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 

<400> 74 

gtgaaagggg taaaggctta gcaac 

<210> 75 
<211> 1776 
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<212> DNA 

<213> Homo sapience 
<400> 75 

ca att aca cat cag cag get ate age ate ctg cag aaa gec aaa gat 47 
lie Thr His Gin Gin Ala He Ser He Leu Gin Lys Ala Lys Asp 
1 5 10 15 

ace gtc cag eta gtt att gee aga ggc tea ttg cct cag ctt gtc age 95 
Thr Val Gin Leu Val He Ala Arg Gly Ser Leu Pro Gin Leu Val Ser 
20 25 30 

ccc ata gtt tec cgt tct cca tct gca gee age aca att tea get cac 143 
Pro He Val Ser Arg Ser Pro Ser Ala Ala Ser Thr He Ser Ala His 
35 40 45 

tct aat ccg gtt cac tgg caa cac atg gaa acg att gaa ttg gtg aat 191 
Ser Asn Pro Val His Trp Gin His Met Glu Thr He Glu Leu Val Asn 
50 55 60 

gat gga tct ggt ttg gga ttt ggc ate ata gga gga aaa gca act ggt 239 
Asp Gly Ser Gly Leu Gly Phe Gly lie He Gly Gly Lys Ala Thr Gly 
65 70 75 



gtg ata gta aaa ace att ctg cct gga gga gta get gat cag cat ggg 
Val lie Val Lys Thr He Leu Pro Gly Gly Val Ala Asp Gin His Gly 
8 ° 85 90 95 



287 
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cgt tta tgc agt gga gac cac att eta aag att ggt gac aca gat eta 335 
Arg Leu Cys Ser Gly Asp His He Leu Lys He Gly Asp Thr Asp Leu 
100 105 110 

gca gga atg age agt gag caa gta gca caa gtc ctt agg caa tgt gga 383 
Ala Gly Met Ser Ser Glu Gin Val Ala Gin Val Leu Arg Gin Cys Gly 
115 120 125 

aat aga gtt aag ttg atg att gca aga agt gec ata gaa gaa cgt aca 431 
Asn Arg Val Lys Leu Met He Ala Arg Ser Ala He Glu Glu Arg Thr 
130 135 140 

gca ccc act get ttg ggc ate ace etc tec tea tec cca act tea acg 479 
Ala Pro Thr Ala Leu Gly He Thr Leu Ser Ser Ser Pro Thr Ser Thr 
145 150 155 

cca gag ttg egg gtt gat get tct act cag aaa ggt gaa gaa agt gag 527 
Pro Glu Leu Arg Val Asp Ala Ser Thr Gin Lys Gly Glu Glu Ser Glu 
160 165 170 175 

aca ttt gat gta gaa etc act aaa aat gtc caa gga tta gga att ace 575 
Thr Phe Asp Val Glu Leu Thr Lys Asn Val Gin Gly Leu Gly He Thr 
180 185 190 

att get ggc tac att gga gat aaa aaa ttg gaa cct tea gga ate ttt 623 
He Ala Gly Tyr He Gly Asp Lys Lys Leu Glu Pro Ser Gly lie Phe 
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195 200 205 

gta aag age att aca aaa age agt gee gtt gag cat gat gga aga ate 671 
Val Lys Ser He Thr Lys Ser Ser Ala Val Glu His Asp Gly Arg He 
210 215 220 

caa att gga gac caa att ata gca gta gat ggc aca aac ctt cag ggt 719 
Gin He Gly Asp Gin lie He Ala Val Asp Gly Thr Asn Leu Gin Gly 
225 230 235 

ttt act aat cag caa gca gta gag gta ttg cga cat aca gga caa act 767 
Phe Thr Asn Gin Gin Ala Val Glu Val Leu Arg His Thr Gly Gin Thr 
240 245 250 255 

gtg etc ctg aca eta atg agg aga gga atg aag cag gaa gec gag etc 815 
Val Leu Leu Thr Leu Met Arg Arg Gly Met Lys Gin Glu Ala Glu Leu 
260 265 270 

atg tea agg gaa gac gtc aca aaa gat gca gat ttg tct cct gtt aat 863 
Met Ser Arg Glu Asp Val Thr Lys Asp Ala Asp Leu Ser Pro Val Asn 
275 280 285 

gee age ata ate aaa gaa aat tat gaa aaa gat gaa gat ttt tta tct 911 
Ala Ser lie lie Lys Glu Asn Tyr Glu Lys Asp Glu Asp Phe Leu Ser 
290 295 300 

teg acg aga aac ace aac ata tta cca act gaa gaa gaa ggg tat cca 959 
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Ser Thr Arg Asn Thr Asn He Leu Pro Thr Glu Glu Glu Gly Tyr Pro 
305 310 315 

tta ctg tea get gag ata gaa gaa ata gaa gat gca caa aaa caa gaa 1007 
Leu Leu Ser Ala Glu He Glu Glu He Glu Asp Ala Gin Lys Gin Glu 
320 325 330 335 

get get ctg ctg aca aaa tgg caa agg att atg gga att aac tat gaa 1055 
Ala Ala Leu Leu Thr Lys Trp Gin Arg He Met Gly He Asn Tyr Glu 
340 345 350 

ata gtg gtg gec cat gtg age aag ttt agt gag aac agt gga ttg ggg 1103 
He Val Val Ala His Val Ser Lys Phe Ser Glu Asn Ser Gly Leu Gly 
355 360 365 

ata age ctg gaa gcg aca gtg gga cat cat ttt ate cga tct gtt eta 1151 
He Ser Leu Glu Ala Thr Val Gly His His Phe He Arg Ser Val Leu 
370 375 380 

cca gag ggt cct gtt gga cac age ggg aag etc ttc agt gga gac gag 1199 
Pro Glu Gly Pro Val Gly His Ser Gly Lys Leu Phe Ser Gly Asp Glu 
385 390 395 

eta ttg gaa gta aat ggc ata act tta ctt ggg gaa aat cac caa gat 1247 
Leu Leu Glu Val Asn Gly He Thr Leu Leu Gly Glu Asn His Gin Asp 
400 405 410 415 
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gtg gtg aat ate tta aaa gaa ctg cct ata gaa gtg aca atg gtg tgc 1295 
Val Val Asn He Leu Lys Glu Leu Pro He Glu Val Thr Met Val Cys 
420 425 430 

tgt cgt cga act gtg cca ccc acc acc caa tea gaa ttg gat age ctg 1343 
Cys Arg Arg Thr Val Pro Pro Thr Thr Gin Ser Glu Leu Asp Ser Leu 
435 440 445 

gac tta tgt gat att gag eta aca gaa aag cct cac gta gat eta ggt 1391 
Asp Leu Cys Asp lie Glu Leu Thr Glu Lys Pro His Val Asp Leu Gly 
450 455 460 

gag ttc ate ggg tea tea gag cca gag gat cca gtg ctg gcg atg act 1439 
Glu Phe He Gly Ser Ser Glu Pro Glu Asp Pro Val Leu Ala Met Thr 
465 470 475 

gat gcg ggt cag agt aca gaa gag gtt caa gca cct ttg gee atg tgg 1487 
Asp Ala Gly Gin Ser Thr Glu Glu Val Gin Ala Pro Leu Ala Met Trp 
480 485 490 . 495 

gag get ggc att cag cac ata atg ctg gag aaa ggg age aaa gga ctt 1535 
Glu Ala Gly lie Gin His He Met Leu Glu Lys Gly Ser Lys Gly Leu 
500 505 510 



ggt ttt age att tta gat tat cag gat cca att gat cca gca age act 
Gly Phe Ser He Leu Asp Tyr Gin Asp Pro He Asp Pro Ala Ser Thr 
515 520 525 



1583 
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gtg att ata att cgt tct ttg gtg cct ggc ggc att get gaa aag gat 
Val He He He Arg Ser Leu Val Pro Gly Gly He Ala Glu Lys Asp 
530 535 540 

gga cga ctt ctt cct ggt gac cga etc atg ttt gta aac gat gtt aac 
Gly Arg Leu Leu Pro Gly Asp Arg Leu Met Phe Val Asn Asp Val Asn 
545 550 555 

ttg gaa aac age agt ctt gag gaa get gta gaa gca ctg aag gga gca 
Leu Glu Asn Ser Ser Leu Glu Glu Ala Val Glu Ala Leu Lys Gly Ala 
560 565 570 575 



ccg tea ggg act gtg aga ata gga gtt get aag cct tta ccc ctt tc 
Pro Ser Gly Thr Val Arg He Gly Val Ala Lys Pro Leu Pro Leu 
580 585 590 



ac 

<210> 76 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
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<400> 76 

gcagatggag aacgggaaac tatgg 76 

<210> 77 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificially Synthesized Primer Sequence 
<400> 77 

gaacgggaaa ctatggggct gacaa 25 

<210> 78 
<211> 777 
<212> DNA 

<213> Hobo sapience 
<400> 78 

ttctcagtca cgcagttcca ttttaattgc tgttaatcat ttcagagaag aacactgaac 60 

tttgaaaaaa atg ttg gaa gcc att gac aaa aat egg gec ctg cat gca 109 
Met Leu Glu Ala He Asp Lys Asn Arg Ala Leu His Ala 
1 5 10 
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gca gag cgc ttg caa acc aag ctg cga gaa cgt ggg gat gta gca aat 157 

Ala Glu Arg Leu Gin Thr Lys Leu Arg Glu Arg Gly Asp Val Ala Asn 
15 20 25 

gaa gac aaa ctg age ctt ctg aag tea gtc ctg cag age cct etc ttc 205 

Glu Asp Lys Leu Ser Leu Leu Lys Ser Val Leu Gin Ser Pro Leu Phe 

30 35 40 45 

agt cag att ctg age ctt cag act tct gta cag cag ctg aaa gac cag 253 

Ser Gin lie Leu Ser Leu Gin Thr Ser Val Gin Gin Leu Lys Asp Gin 

50 55 60 

gta aat att gca act tea gca act tea aat att gaa tat gec cac gtt 301 

Val Asn He Ala Thr Ser Ala Thr Ser Asn He Glu Tyr Ala His Val 
65 70 75 

cct cat etc age cca get gtg att cct act ctg caa aat gaa teg ttt 349 

Pro His Leu Ser Pro Ala Val He Pro Thr Leu Gin Asn Glu Ser Phe 
80 85 90 

tta tta tec cca aac aat ggg aat ctg gaa gca ctt aca gga cct ggt 397 

Leu Leu Ser Pro Asn Asn Gly Asn Leu Glu Ala Leu Thr Gly Pro Gly 
95 100 105 

att cca cac att aat ggg aaa cct get tgt gat gaa ttt gat cag ctt 445 

He Pro His lie Asn Gly Lys Pro Ala Cys Asp Glu Phe Asp Gin Leu 

110 115 120 125 
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ate aaa aat atg gec cag ggt cgc cat gta gaa gtt ttt gag etc etc 493 
He Lys Asn Met Ala Gin Gly Arg His Val Glu Val Phe Glu Leu Leu 
130 135 140 

aaa cct cca tct gga ggc ctt ggg ttt agt gtt gtg gga eta aga agt 541 
Lys Pro Pro Ser Gly Gly Leu Gly Phe Ser Val Val Gly Leu Arg Ser 
145 150 155 

gaa aac aga gga gag ctg gga ata ttt gtt caa gag ata caa gag ggc 589 
Glu Asn Arg Gly Glu Leu Gly He Phe Val Gin Glu He Gin Glu Gly 
160 165 170 

agt gtg gec cat aga gat gga aga ttg aaa gaa act gat caa att ctt 637 
Ser Val Ala His Arg Asp Gly Arg Leu Lys Glu Thr Asp Gin He Leu 
175 180 185 

get ate aat gga cag get ctt gat cag aca att aca cat cag cag get 685 
Ala He Asn Gly Gin Ala Leu Asp Gin Thr He Thr His Gin Gin Ala 
190 195 200 205 

ate age ate ctg cag aaa gee aaa gat act gtc cag eta gtt att gec 733 
He Ser He Leu Gin Lys Ala Lys Asp Thr Val Gin Leu Val He Ala 
210 215 220 



aga ggc tea ttg cct cag ctt gtc age ccc ata gtt tec cgt 
Arg Gly Ser Leu Pro Gin Leu Val Ser Pro He Val Ser Arg 



775 
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225 230 235 



tc 

<210> 79 
<211> 755 
<212> MA 

<213> Homo sapience 
<400> 79 

tt cct tct gtg eta ccc gat tea get gga aag ggc tct gag tac ctg 
Pro Ser Val Leu Pro Asp Ser Ala Gly Lys Gly Ser Glu Tyr Leu 
15 10 15 

ctt gaa cag age tec ctg gee tgt aat get gag tgt gtc atg ctt caa 
Leu Glu Gin Ser Ser Leu Ala Cys Asn Ala Glu Cys Val Met Leu Gin 
20 25 30 

aat gta tct aaa gaa tct ttt gaa agg act att aat ata gca aaa ggc 
Asn Val Ser Lys Glu Ser Phe Glu Arg Thr lie Asn He Ala Lys Gly 
35 40 45 

aat tct age eta gga atg aca gtt agt get aat aaa gat ggc ttg ggg 
Asn Ser Ser Leu Gly Met Thr Val Ser Ala Asn Lys Asp Gly Leu Gly 
50 55 60 
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atg ate gtt cga age att att cat gga ggt gec att agt cga gat ggc 239 
Met He Val Arg Ser He He His Gly Gly Ala lie Ser Arg Asp Gly 
65 70 75 

egg att gec att ggg gac tgc ate ttg tec att aat gaa gag tct acc 287 
Arg He Ala He Gly Asp Cys He Leu Ser He Asn Glu Glu Ser Thr 
80 85 90 95 

ate agt gta acc aat gee cag gca cga get atg ttg aga aga cat tct 335 
He Ser Val Thr Asn Ala Gin Ala Arg Ala Met Leu Arg Arg His Ser 
100 105 110 

etc att ggc cct gac ata aaa att act tat gtg cct gca gaa cat ttg 383 
Leu He Gly Pro Asp lie Lys He Thr Tyr Val Pro Ala Glu His Leu 
115 120 125 

gaa gag ttc aaa ata age ttg gga caa caa tct gga aga gta atg gca 431 
Glu Glu Phe Lys He Ser Leu Gly Gin Gin Ser Gly Arg Val Met Ala 
130 135 140 

ctg gat att ttt tct tea tac act ggc aga gac att cca gaa tta cca 479 
Leu Asp He Phe Ser Ser Tyr Thr Gly Arg Asp He Pro Glu Leu Pro 
145 150 155 

gag cga gaa gag gga gag ggt gaa gaa age gaa ctt caa aac aca gca 527 
Glu Arg Glu Glu Gly Glu Gly Glu Glu Ser Glu Leu Gin Asn Thr Ala 
160 165 170 175 
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tat age aat tgg aat cag ccc agg egg gtg gaa etc tgg aga gaa cca 575 
Tyr Ser Asn Trp Asn Gin Pro Arg Arg Val Glu Leu Trp Arg Glu Pro 
180 185 190 

age aaa tec tta ggc ate age att gtt ggt gga cga ggg atg ggg agt 623 
Ser Lys Ser Leu Gly He Ser lie Val Gly Gly Arg Gly Met Gly Ser 
195 200 205 

egg eta age aat gga gaa gtg atg agg ggc att ttc ate aaa cat gtt 671 
Arg Leu Ser Asn Gly Glu Val Met Arg Gly lie Phe He Lys His Val 
210 215 220 

ctg gaa gat agt cca get ggc aaa aat gga acc ttg aaa cct gga gat 719 
Leu Glu Asp Ser Pro Ala Gly Lys Asn Gly Thr Leu Lys Pro Gly Asp 
225 230 235 

aga ate gta gag gca ccc agt cag tea gag tea gag 755 
Arg He Val Glu Ala Pro Ser Gin Ser Glu Ser Glu 
240 245 250 

<210> 80 

<211> 865 

<212> DNA 

< 2 1 3 > Homo sapience 



<400> 80 
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tt cct tct gtg eta ccc gat tea get gga aag ggc tct gag tac ctg 
Pro Ser Val Leu Pro Asp Ser Ala Gly Lys Gly Ser Glu Tyr Leu 
1 5 10 15 



47 



ctt gaa cag age tec ctg gee tgt aat get gag tgt gtc atg ctt caa 
Leu Glu Gin Ser Ser Leu Ala Cys Asn Ala Glu Cys Val Met Leu Gin 
20 25 30 



95 



aat gta tct aaa gaa tct ttt gaa agg act att aat ata gca aaa ggc 
Asn Val Ser Lys Glu Ser Phe Glu Arg Thr He Asn He Ala Lys Gly 
35 40 45 



143 



aat tct age eta gga atg aca gtt agt get aat aaa gat ggc ttg ggg 
Asn Ser Ser Leu Gly Met Thr Val Ser Ala Asn Lys Asp Gly Leu Gly 
50 55 60 



191 



atg ate gtt cga age att att cat gga ggt gee att agt cga gat ggc 
Met He Val Arg Ser He He His Gly Gly Ala He Ser Arg Asp Gly 
65 70 75 



239 



egg att gee att ggg gac tgc ate ttg tec att aat gaa gag tct acc 
Arg He Ala He Gly Asp Cys He Leu Ser He Asn Glu Glu Ser Thr 
80 85 90 95 



287 



ate agt gta acc aat gee cag gca cga get atg ttg aga aga cat tct 
He Ser Val Thr Asn Ala Gin Ala Arg Ala Met Leu Arg Arg His Ser 
100 105 110 



335 
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etc att ggc cct gac ata aaa att act tat gtg cct gca gaa cat ttg 383 

Leu He Gly Pro Asp He Lys He Thr Tyr Val Pro Ala Glu His Leu 
115 120 125 

gaa gag ttc aaa ata age ttg gga caa caa tct gga aga gta atg gca 431 

Glu Glu Phe Lys He Ser Leu Gly Gin Gin Ser Gly Arg Val Met Ala 
130 135 140 

ctg gat att ttt tct tea tac act ggc aga gac att cca gaa tta cca 479 

Leu Asp He Phe Ser Ser Tyr Thr Gly Arg Asp lie Pro Glu Leu Pro 
145 150 155 

gag cga gaa gag gga gag ggt gaa gaa age gaa ctt caa aac aca gca 527 

Glu Arg Glu Glu Gly Glu Gly Glu Glu Ser Glu Leu Gin Asn Thr Ala 
160 165 170 175 

tat age aat tgg aat cag ccc agg egg gtg gaa etc tgg aga gaa cca 575 

Tyr Ser Asn Trp Asn Gin Pro Arg Arg Val Glu Leu Trp Arg Glu Pro 
180 185 190 

age aaa tec tta ggc ate age att gtt ggt gga cga ggg atg ggg agt 623 

Ser Lys Ser Leu Gly He Ser He Val Gly Gly Arg Gly Met Gly Ser 
195 200 205 

"egg eta age aat gga gaa gtg atg agg ggc att ttc ate aaa cat gtt 671 

Arg Leu Ser Asn Gly Glu Val Met Arg Gly He Phe He Lys His Val 
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210 215 

ctg gaa gat agt cca get ggc aaa aat 
Leu Glu Asp Ser Pro Ala Gly Lys Asn 
225 230 

aga ate gta gag gtg gat gga atg gac 
Arg He Val Glu Val Asp Gly Met Asp 
240 245 

caa get gtg gaa gec att egg aaa gca 
Gin Ala Val Glu Ala He Arg Lys Ala 
260 

gta tag age ttt att aca gac caa ggg 
Val 

275 280 

ag 

<210> 81 
<211> 965 
<212> DNA 

<213> Homo sapience 
<400> 81 



220 

gga ace ttg aaa cct gga gat 719 

Gly Thr Leu Lys Pro Gly Asp 
235 

etc aga gat gca age cat gaa 767 

Leu Arg Asp Ala Ser His Glu 
250 255 

ggc aac cct gta gtc ttt atg 815 
Gly Asn Pro Val Val Phe Met 
265 270 

cac cca gtc agt cag agt cag 863 

285 

865 



tt cct tct gtg eta ccc gat tea get gga aag ggc tct gag tac ctg 



47 
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Pro Ser Val Leu Pro Asp Ser Ala Gly Lys Gly Ser Glu Tyr Leu 
15 10 15 

ctt gaa cag age tec ctg gec tgt aat get gag tgt gtc atg ctt caa 95 
Leu Glu Gin Ser Ser Leu Ala Cys Asn Ala Glu Cys Val Met Leu Gin 
20 25 30 

aat gta tct aaa gaa tct ttt gaa agg act att aat ata gca aaa ggc 143 
Asn Val Ser Lys Glu Ser Phe Glu Arg Thr He Asn lie Ala Lys Gly 
35 40 45 

aat tct age eta gga atg aca gtt agt get aat aaa gat ggc ttg ggg 191 
Asn Ser Ser Leu Gly Met Thr Val Ser Ala Asn Lys Asp Gly Leu Gly 
50 55 60 

atg ate gtt cga age att att cat gga ggt gee att agt cga gat ggc 239 
Met He Val Arg Ser He He His Gly Gly Ala He Ser Arg Asp Gly 
65 70 75 

egg att gec att ggg gac tgc ate ttg tec att aat gaa gag tct acc 287 
Arg He Ala lie Gly Asp Cys lie Leu Ser He Asn Glu Glu Ser Thr 
80 85 90 95 

ate agt gta acc aat gee cag gca cga get atg ttg aga aga cat tct 335 
He Ser Val Thr Asn Ala Gin Ala Arg Ala Met Leu Arg Arg His Ser 
100 105 110 
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etc att ggc cct gac ata aaa att act tat gtg cct gca gaa cat ttg 383 
Leu lie Gly Pro Asp He Lys lie Thr Tyr Val Pro Ala Glu His Leu 
115 120 125 

gaa gag ttc aaa ata age ttg gga caa caa tct gga aga gta atg gca 431 
Glu Glu Phe Lys He Ser Leu Gly Gin Gin Ser Gly Arg Val Met Ala 
130 135 140 

ctg gat att ttt tct tea tac act ggc aga gac att cca gaa tta cca 479 
Leu Asp He Phe Ser Ser Tyr Thr Gly Arg Asp He Pro Glu Leu Pro 
145 150 155 

gag cga gaa gag gga gag ggt gaa gaa age gaa ctt caa aac aca gca 527 
Glu Arg Glu Glu Gly Glu Gly Glu Glu Ser Glu Leu Gin Asn Thr Ala 
160 165 170 175 

tat age aat tgg aat cag ccc agg egg gtg gaa etc tgg aga gaa cca 575 
Tyr Ser Asn Trp Asn Gin Pro Arg Arg Val Glu Leu Trp Arg Glu Pro 
180 185 190 

age aaa tec tta ggc ate age att gtt ggt gga cga ggg atg ggg agt 623 
Ser Lys Ser Leu Gly He Ser He Val Gly Gly Arg Gly Met Gly Ser 
195 200 205 

egg eta age aat gga gaa gtg atg agg ggc att ttc ate aaa cat gtt 671 
Arg Leu Ser Asn Gly Glu Val Met Arg Gly lie Phe He Lys His Val 
210 215 220 
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ctg gaa gat agt cca get ggc aaa aat gga acc ttg aaa cct gga gat 719 
Leu Glu Asp Ser Pro Ala Gly Lys Asn Gly Thr Leu Lys Pro Gly Asp 
225 230 235 

aga ate gta gag gtg gat gga atg gac etc aga gat gca age cat gaa 767 
Arg He Val Glu Val Asp Gly Met Asp Leu Arg Asp Ala Ser His Glu 
240 245 250 255 

caa get gtg gaa gec att egg aaa gca ggc aac cct gta gtc ttt atg 815 
Gin Ala Val Glu Ala He Arg Lys Ala Gly Asn Pro Val Val Phe Met 
260 265 270 

gta cag age att ata aac aga cca agg aaa tec cct ttg cct tec ttg 863 
Val Gin Ser He He Asn Arg Pro Arg Lys Ser Pro Leu Pro Ser Leu 
275 280 285 

ctg cac aac ctt tac cct aag tac aac ttc age age act aac cca ttt 911 
Leu His Asn Leu Tyr Pro Lys Tyr Asn Phe Ser Ser Thr Asn Pro Phe 
290 295 300 

get gac tct eta caa ate aac gec gac aag gca ccc agt cag tea gag 959 
Ala Asp Ser Leu Gin He Asn Ala Asp Lys Ala Pro Ser Gin Ser Glu 
305 310 315 

tea gag 965 
Ser Glu 



85/166 



320 

<210> 82 
<211> 2000 
<212> PRT 

< 2 1 3 > Homo sapience 
<400> 82 

Met Leu Glu Ala He Asp Lys Asn Arg Ala Leu His Ala 
1 5 10 

Ala Glu Arg Leu Gin Thr Lys Leu Arg Glu Arg Gly Asp Val Ala Asn 

15 20 25 

Glu Asp Lys Leu Ser Leu Leu Lys Ser Val Leu Gin Ser Pro Leu Phe 
30 35 40 45 

Ser Gin He Leu Ser Leu Gin Thr Ser Val Gin Gin Leu Lys Asp Gin 

50 55 60 

Val Asn He Ala Thr Ser Ala Thr Ser Asn He Glu Tyr Ala His Val 

65 70 75 

Pro His Leu Ser Pro Ala Val He Pro Thr Leu Gin Asn Glu Ser Phe 

80 85 90 

Leu Leu Ser Pro Asn Asn Gly Asn Leu Glu Ala Leu Thr Gly Pro Gly 

95 100 105 

He Pro His He Asn Gly Lys Pro Ala Cys Asp Glu Phe Asp Gin Leu 
110 115 120 125 

lie Lys Asn Met Ala Gin Gly Arg His Val Glu Val Phe Glu Leu Leu 

130 135 140 

Lys Pro Pro Ser Gly Gly Leu Gly Phe Ser Val Val Gly Leu Arg Ser 
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145 150 155 

Glu Asn Arg Gly Glu Leu Gly He Phe Val Gin Glu He Gin Glu Gly 

160 165 170 

Ser Val Ala His Arg Asp Gly Arg Leu Lys Glu Thr Asp Gin He Leu 

175 180 185 

Ala He Asn Gly Gin Ala Leu Asp Gin Thr He Thr His Gin Gin Ala 
190 195 200 205 

He Ser He Leu Gin Lys Ala Lys Asp Thr Val Gin Leu Val He Ala 

210 215 220 

Arg Gly Ser Leu Pro Gin Leu Val Ser Pro He Val Ser Arg Ser Pro 

225 230 235 

Ser Ala Ala Ser Thr He Ser Ala His Ser Asn Pro Val His Trp Gin 

240 245 250 

His Met Glu Thr He Glu Leu Val Asn Asp Gly Ser Gly Leu Gly Phe 

255 260 265 

Gly He lie Gly Gly Lys Ala Thr Gly Val He Val Lys Thr He Leu 
270 275 280 285 

Pro Gly Gly Val Ala Asp Gin His Gly Arg Leu Cys Ser Gly Asp His 

290 295 . 300 

He Leu Lys lie Gly Asp Thr Asp Leu Ala Gly Met Ser Ser Glu Gin 

305 310 315 

Val Ala Gin Val Leu Arg Gin Cys Gly Asn Arg Val Lys Leu Met He 

320 325 330 

Ala Arg Ser Ala He Glu Glu Arg Thr Ala Pro Thr Ala Leu Gly He 

335 340 345 

Thr Leu Ser Ser Ser Pro Thr Ser Thr Pro Glu Leu Arg Val Asp Ala 
350 355 360 365 
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Ser Thr Gin Lys Gly Glu Glu Ser Glu Thr Phe Asp Val Glu Leu Thr 

370 375 380 

Lys Asn Val Gin Gly Leu Gly He Thr lie Ala Gly Tyr He Gly Asp 

385 390 395 

Lys Lys Leu Glu Pro Ser Gly He Phe Val Lys Ser He Thr Lys Ser 

400 405 410 

Ser Ala Val Glu His Asp Gly Arg He Gin He Gly Asp Gin He He 

415 420 425 

Ala Val Asp Gly Thr Asn Leu Gin Gly Phe Thr Asn Gin Gin Ala Val 
430 435 440 445 

Glu Val Leu Arg His Thr Gly Gin Thr Val Leu Leu Thr Leu Met Arg 

450 455 460 

Arg Gly Met Lys Gin Glu Ala Glu Leu Met Ser Arg Glu Asp Val Thr 

465 470 475 

Lys Asp Ala Asp Leu Ser Pro Val Asn Ala Ser lie He Lys Glu Asn 

480 485 490 

Tyr Glu Lys Asp Glu Asp Phe Leu Ser Ser Thr Arg Asn Thr Asn He 

495 500 505 

Leu Pro Thr Glu Glu Glu Gly Tyr Pro Leu Leu Ser Ala Glu He Glu 
510 515 520 525 

Glu He Gltf Asp Ala Gin Lys Gin Glu Ala Ala Leu Leu Thr Lys Trp 

530 535 540 

Gin Arg He Met Gly He Asn Tyr Glu He Val Val Ala His Val Ser 

545 550 555 

Lys Phe Ser Glu Asn Ser Gly Leu Gly He Ser Leu Glu Ala Thr Val 

560 565 570 

Gly His His Phe He Arg Ser Val Leu Pro Glu Gly Pro Val Gly His 
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575 580 585 

Ser Gly Lys Leu Phe Ser Gly Asp Glu Leu Leu Glu Val Asn Gly He 
590 595 600 605 

Thr Leu Leu Gly Glu Asn His Gin Asp Val Val Asn He Leu Lys Glu 

610 615 620 

Leu Pro He Glu Val Thr Met Val Cys Cys Arg Arg Thr Val Pro Pro 

625 630 635 

Thr Thr Gin Ser Glu Leu Asp Ser Leu Asp Leu Cys Asp He Glu Leu 

640 645 650 

Thr Glu Lys Pro His Val Asp Leu Gly Glu Phe lie Gly Ser Ser Glu 

655 660 665 

Pro Glu Asp Pro Val Leu Ala Met Thr Asp Ala Gly Gin Ser Thr Glu 
670 675 ^ 680 685 

Glu Val Gin Ala Pro Leu Ala Met Trp Glu Ala Gly He Gin His He 

690 695 700 

Met Leu Glu Lys Gly Ser Lys Gly Leu Gly Phe Ser He Leu Asp Tyr 

705 710 715 

Gin Asp Pro He Asp Pro Ala Ser Thr Val He He lie Arg Ser Leu 

720 725 730 

Val Pro Gly Gly He Ala Glu Lys Asp Gly Arg Leu Leu Pro Gly Asp 

735 740 745 

Arg Leu Met Phe Val Asn Asp Val Asn Leu Glu Asn Ser Ser Leu Glu 
750 755 760 765 

Glu Ala Val Glu Ala Leu Lys Gly Ala Pro Ser Gly Thr Val Arg He 

770 775 780 

Gly Val Ala Lys Pro Leu Pro Leu Ser Pro Glu Glu Gly Tyr Val Ser 
785 790 795 
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Ala Lys Glu Asp Ser Phe Leu Tyr Pro Pro His Ser Cys Glu Glu Ala 

800 805 810 

Gly Leu Ala Asp Lys Pro Leu Phe Arg Ala Asp Leu Ala Leu Val Gly 

815 820 825 

Thr Asn Asp Ala Asp Leu Val Asp Glu Ser Thr Phe Glu Ser Pro Tyr 
830 835 840 845 

Ser Pro Glu Asn Asp Ser lie Tyr Ser Thr Gin Ala Ser He Leu Ser 

850 855 860 

Leu His Gly Ser Ser Cys Gly Asp Gly Leu Asn Tyr Gly Ser Ser Leu 

865 870 875 

Pro Ser Ser Pro Pro Lys Asp Val He Glu Asn Ser Cys Asp Pro Val 

880 885 890 

Leu Asp Leu His Met Ser Leu Glu Glu Leu Tyr Thr Gin Asn Leu Leu 

895 900 905 

Glu Arg Gin Asp Glu Asn Thr Pro Ser Val Asp He Ser Met Gly Pro 
910 915 920 925 

Ala Ser Gly Phe Thr He Asn Asp Tyr Thr Pro Ala Asn Ala lie Glu 

930 935 940 

Gin Gin Tyr Glu Cys Glu Asn Thr He Val Trp Thr Glu Ser His Leu 

945 950 955 

Pro Ser Glu Val He Ser Ser Ala Glu Leu Pro Ser Val Leu Pro Asp 

960 965 970 

Ser Ala Gly Lys Gly Ser Glu His Leu Leu Glu Gin Ser Ser Leu Ala 

975 980 985 

Cys Asn Ala Glu Cys Val Met Leu Gin Asn Val Ser Lys Glu Ser Phe 
990 995 1000 1005 

Glu Arg Thr lie Asn He Ala Lys Gly Asn Ser Ser Leu Gly Met Thr 
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1010 1015 1020 

Val Ser Ala Asn Lys Asp Gly Leu Gly Met He Val Arg Ser He He 

1025 1030 1035 

His Gly Gly Ala He Ser Arg Asp Gly Arg He Ala He Gly Asp Cys 

1040 1045 1050 

He Leu Ser He Asn Glu Glu Ser Thr He Ser Val Thr Asn Ala Gin 

1055 1060 1065 

Ala Arg Ala Met Leu Arg Arg His Ser Leu He Gly Pro Asp He Lys 
1070 1075 1080 1085 

lie Thr Tyr Val Pro Ala Glu His Leu Glu Glu Phe Lys He Ser Leu 

1090 1095 1100 

Gly Gin Gin Ser Gly Arg Val Met Ala Leu Asp lie Phe Ser Ser Tyr 

1105 1110 1115 

Thr Gly Arg Asp He Pro Glu Leu Pro Glu Arg Glu Glu Gly Glu Gly 

1120 1125 1130 

Glu Glu Ser Glu Leu Gin Asn Thr Ala Tyr Ser Asn Trp Asn Gin Pro 

1135 1140 1145 

Arg Arg Val Glu Leu Trp Arg Glu Pro Ser Lys Ser Leu Gly He Ser 
1150 1155 1160 1165 

He Val Gly Gly Arg Gly Met Gly Ser Arg Leu Ser Asn Gly Glu Val 

1170 1175 1180 

Met Arg Gly He Phe He Lys His Val Leu Glu Asp Ser Pro Ala Gly 

1185 1190 1195 

Lys Asn Gly Thr Leu Lys Pro Gly Asp Arg lie Val Glu Ala Pro Ser 

1200 1205 1210 

Gin Ser Glu Ser Glu Pro Glu Lys Ala Pro Leu Cys Ser Val Pro Pro 
1215 1220 1225 
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Pro Pro Pro Ser Ala Phe Ala Glu Met Gly Ser Asp His Thr Gin Ser 
1230 1235 1240 1245 

Ser Ala Ser Lys He Ser Gin Asp Val Asp Lys Glu Asp Glu Phe Gly 

1250 1255 1260 

Tyr Ser Trp Lys Asn He Arg Glu Arg Tyr Gly Thr Leu Thr Gly Glu 

1265 1270 1275 

Leu His Met He Glu Leu Glu Lys Gly His Ser Gly Leu Gly Leu Ser 

1280 1285 1290 

Leu Ala Gly Asn Lys Asp Arg Ser Arg Met Ser Val Phe He Val Gly 

1295 1300 1305 

He Asp Pro Asn Gly Ala Ala Gly Lys Asp Gly Arg Leu Gin He Ala 
1310 1315 1320 1325 

Asp Glu Leu Leu Glu He Asn Gly Gin He Leu Tyr Gly Arg Ser His 

1330 1335 1340 

Gin Asn Ala Ser Ser He He Lys Cys Ala Pro Ser Lys Val Lys He 

1345 1350 1355 

He Phe He Arg Asn Lys Asp Ala Val Asn Gin Met Ala Val Cys Pro 

1360 1365 1370 

Gly Asn Ala Val Glu Pro Leu Pro Ser Asn Ser Glu Asn Leu Gin Asn 

1375 1380 1385 

Lys Glu Thr Glu Pro Thr Val Thr Thr Ser Asp Ala Ala Val Asp Leu 
1390 1395 1400 1405 

Ser Ser Phe Lys Asn Val Gin His Leu Glu Leu Pro Lys Asp Gin Gly 

1410 1415 1420 

Gly Leu Gly He Ala He Ser Glu Glu Asp Thr Leu Ser Gly Val He 

1425 1430 1435 

He Lys Ser Leu Thr Glu His Gly Val Ala Ala Thr Asp Gly Arg Leu 
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1440 1445 1450 

Lys Val Gly Asp Gin He Leu Ala Val Asp Asp Glu He Val Val Gly 

1455 1460 1465 

Tyr Pro He Glu Lys Phe He Ser Leu Leu Lys Thr Ala Lys Met Thr 
1470 1475 1480 1485 

Val Lys Leu Thr He His Ala Glu Asn Pro Asp Ser Gin Ala Val Pro 

1490 1495 1500 

Ser Ala Ala Gly Ala Ala Ser Gly Glu Lys Lys Asn Ser Ser Gin Ser 

1505 1510 1515 

Leu Met Val Pro Gin Ser Gly Ser Pro Glu Pro Glu Ser He Arg Asn 

~ 1520 1525 1530 

Thr Ser Arg Ser Ser Thr Pro Ala He Phe Ala Ser Asp Pro Ala Thr 

1535 1540 1545 

Cys Pro He He Pro Gly Cys Glu Thr Thr lie Glu He Ser Lys Gly 
1550 1555 1560 1565 

Arg Thr Gly Leu Gly Leu Ser He Val Gly Gly Ser Asp Thr Leu Leu 

1570 1575 1580 

Gly Ala Phe He lie His Glu Val Tyr Glu Glu Gly Ala Ala Cys Lys 

1585 1590 1595 

Asp Gly ArgpLeu Trp Ala Gly Asp Gin He Leu Glu Val Asn Gly lie 

1600 1605 1610 

Asp Leu Arg Lys Ala Thr His Asp Glu Ala He Asn Val Leu Arg Gin 

1615 1620 1625 

Thr Pro Gin Arg Val Arg Leu Thr Leu Tyr Arg Asp Glu Ala Pro Tyr 
1630 1635 1640 1645 

Lys Glu Glu Glu Val Cys Asp Thr Leu Thr He Glu Leu Gin Lys Lys 
1650 1655 1660 
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Pro Gly Lys Gly Leu Gly Leu Ser lie Val Gly Lys Arg Asn Asp Thr 

1665 1670 1675 

Gly Val Phe Val Ser Asp He Val Lys Gly Gly He Ala Asp Pro Asp 

1680 1685 1690 

Gly Arg Leu He Gin Gly Asp Gin He Leu Leu Val Asn Gly Glu Asp 

1695 1700 1705 

Val Arg Asn Ala Ser Gin Glu Ala Val Ala Ala Leu Leu Lys Cys Ser 
1710 1715 1720 1725 

Leu Gly Thr Val Thr Leu Glu Val Gly Arg He Lys Ala Gly Pro Phe 

1730 1735 1740 

His Ser Glu Arg Arg Pro Ser Gin Thr Ser Gin Val Ser Glu Gly Ser 

1745 1750 1755 

Leu Ser Ser Phe Thr Phe Pro Leu Ser Gly Ser Ser Thr Ser Glu Ser 

1760 1765 1770 

Leu Glu Ser Ser Ser Lys Lys Asn Ala Leu Ala Ser Glu He Gin Gly 

1775 1780 1785 

Leu Arg Thr Val Glu Met Lys Lys Gly Pro Thr Asp Ser Leu Gly He 
1790 1795 1800 1805 

Ser He Ala Gly Gly Val Gly Ser Pro Leu Gly Asp Val Pro He Phe 

1810 1815 1820 

lie Ala Met Het His Pro Thr Gly Val Ala Ala Gin Thr Gin Lys Leu 

1825 1830 1835 

Arg Val Gly Asp Arg He Val Thr He Cys Gly Thr Ser Thr Glu Gly 

1840 1845 1850 

Het Thr His Thr Gin Ala Val Asn Leu Leu Lys Asn Ala Ser Gly Ser 

1855 1860 1865 

lie Glu Met Gin Val Val Ala Gly Gly Asp Val Ser Val Val Thr Gly 
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1870 1875 1880 1885 

His His Gin Glu Pro Ala Ser Ser Ser Leu Ser Phe Thr Gly Leu Thr 

1890 1895 1900 

Ser Thr Ser He Phe Gin Asp Asp Leu Gly Pro Pro Gin Cys Lys Ser 

1905 1910 1915 

He Thr Leu Glu Arg Gly Pro Asp Gly Leu Gly Phe Ser He Val Gly 

1920 1925 1930 

Gly Tyr Gly Ser Pro His Gly Asp Leu Pro He Tyr Val Lys Thr Val 

1935 1940 1945 

Phe Ala Lys Gly Ala Ala Ser Glu Asp Gly Arg Leu Lys Arg Gly Asp 
1950 1955 1960 1965 

Gin He He Ala Val Asn Gly Gin Ser Leu Glu Gly Val Thr His Glu 

1970 1975 1980 

Glu Ala Val Ala He Leu Lys Arg Thr Lys Gly Thr Val Thr Leu Met 
1985 1990 1995 

Val Leu Ser 
2000 

<210> 83 
<211> 2070 
<212> PET 

<213> Homo sapience 
<400> 83 

Met Leu Glu Ala He Asp Lys Asn Arg Ala Leu His Ala 
1 5 10 

Ala Glu Arg Leu Gin Thr Lys Leu Arg Glu Arg Gly Asp Val Ala Asn 
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15 20 25 

G lu Asp Lys Leu Ser Leu Leu Lys Ser Val Leu Gin Ser Pro Leu Phe 
30 35 40 45 

Ser Gin He Leu Ser Leu Gin Thr Ser Val Gin Gin Leu Lys Asp Gin 

50 55 60 

Val Asn lie Ala Thr Ser Ala Thr Ser Asn He Glu Tyr Ala His Val 

65 70 75 

Pro His Leu Ser Pro Ala Val He Pro Thr Leu Gin Asn Glu Ser Phe 

80 85 90 

Leu Leu Ser Pro Asn Asn Gly Asn Leu Glu Ala Leu Thr Gly Pro Gly 

95 100 105 

He Pro His He Asn Gly Lys Pro Ala Cys Asp Glu Phe Asp Gin Leu 
110 115 120 125 

He Lys Asn Met Ala Gin Gly Arg His Val Glu Val Phe Glu Leu Leu 

130 135 140 

Lys Pro Pro Ser Gly Gly Leu Gly Phe Ser Val Val Gly Leu Arg Ser 

145 150 155 

Glu Asn Arg Gly Glu Leu Gly lie Phe Val Gin Glu He Gin Glu Gly 

160 165 170 

Ser Val Ala His Arg Asp Gly Arg Leu Lys Glu Thr Asp Gin He Leu 

175 180 185 

Ala He Asn Gly Gin Ala Leu Asp Gin Thr He Thr His Gin Gin Ala 
190 195 200 205 

He Ser He Leu Gin Lys Ala Lys Asp Thr Val Gin Leu Val He Ala 

210 215 220 

Arg Gly Ser Leu Pro Gin Leu Val Ser Pro He Val Ser Arg Ser Pro 
225 230 235 
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Ser Ala Ala Ser Thr He Ser Ala His Ser Asn Pro Val His Trp Gin 

240 245 250 

His Met Glu Thr He Glu Leu Val Asn Asp Gly Ser Gly Leu Gly Phe 

255 260 265 

Gly lie He Gly Gly Lys Ala Thr Gly Val He Val Lys Thr He Leu 
270 275 280 285 

Pro Gly Gly Val Ala Asp Gin His Gly Arg Leu Cys Ser Gly Asp His 

290 295 300 

He Leu Lys He Gly Asp Thr Asp Leu Ala Gly Met Ser Ser Glu Gin 

305 310 315 

Val Ala Gin Val Leu Arg Gin Cys Gly Asn Arg Val Lys Leu Met He 

320 325 330 

Ala Arg Ser Ala He Glu Glu Arg Thr Ala Pro Thr Ala Leu Gly He 

335 340 345 

Thr Leu Ser Ser Ser Pro Thr Ser Thr Pro Glu Leu Arg Val Asp Ala 
350 355 360 365 

Ser Thr Gin Lys Gly Glu Glu Ser Glu Thr Phe Asp Val Glu Leu Thr 

370 375 380 

Lys Asn Val Gin Gly Leu Gly He Thr He Ala Gly Tyr lie Gly Asp 

385 390 395 

Lys Lys Leu. Glu Pro Ser Gly He Phe Val Lys Ser He Thr Lys Ser 

400 405 410 

Ser Ala Val Glu His Asp Gly Arg He Gin He Gly Asp Gin lie He 

415 420 425 

Ala Val Asp Gly Thr Asn Leu Gin Gly Phe Thr Asn Gin Gin Ala Val 
430 435 440 445 

Glu Val Leu Arg His Thr Gly Gin Thr Val Leu Leu Thr Leu Met Arg 
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450 455 460 

Arg Gly Met Lys Gin Glu Ala Glu Leu Met Ser Arg Glu Asp Val Thr 

465 470 475 

Lys Asp Ala Asp Leu Ser Pro Val Asn Ala Ser He He Lys Glu Asn 

480 485 490 

Tyr Glu Lys Asp Glu Asp Phe Leu Ser Ser Thr Arg Asn Thr Asn He 

495 500 505 

Leu Pro Thr Glu Glu Glu Gly Tyr Pro Leu Leu Ser Ala Glu He Glu 
510 515 520 525 

Glu He Glu Asp Ala Gin Lys Gin Glu Ala Ala Leu Leu Thr Lys Trp 

530 535 540 

Gin Arg He Met Gly He Asn Tyr Glu He Val Val Ala His Val Ser 

545 550 555 

Lys Phe Ser Glu Asn Ser Gly Leu Gly lie Ser Leu Glu Ala Thr Val 

560 565 570 

Gly His His Phe He Arg Ser Val Leu Pro Glu Gly Pro Val Gly His 

575 580 585 

Ser Gly Lys Leu Phe Ser Gly Asp Glu Leu Leu Glu Val Asn Gly He 
590 595 600 605 

Thr Leu Lett 61y Glu Asn His Gin Asp Val Val Asn He Leu Lys Glu 

610 615 620 

Leu Pro He Glu Val Thr Met Val Cys Cys Arg Arg Thr Val Pro Pro 

625 630 635 

Thr Thr Gin Ser Glu Leu Asp Ser Leu Asp Leu Cys Asp He Glu Leu 

640 645 650 

Thr Glu Lys Pro His Val Asp Leu Gly Glu Phe He Gly Ser Ser Glu 
655 660 665 
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Pro Glu Asp Pro Val Leu Ala Met Thr Asp Ala Gly Gin Ser Thr Glu 
670 675 680 685 

Glu Val Gin Ala Pro Leu Ala Met Trp Glu Ala Gly He Gin His ile 

690 695 700 

Met Leu Glu Lys Gly Ser Lys Gly Leu Gly Phe Ser Ile Leu Asp Tyr 

705 710 715 

Gin Asp Pro Ile Asp Pro Ala Ser Thr Val Ile Ile Ile Arg Ser Leu 

720 725 730 

Val Pro Gly Gly Ile Ala Glu Lys Asp Gly Arg Leu Leu Pro Gly Asp 

735 740 745 

Arg Leu Met Phe Val Asn Asp Val Asn Leu Glu Asn Ser Ser Leu Glu 
750 755 760 765 

Glu Ala Val Glu Ala Leu Lys Gly Ala Pro Ser Gly Thr Val Arg Ile 

770 775 780 

Gly Val Ala Lys Pro Leu Pro Leu Ser Pro Glu Glu Gly Tyr Val Ser 

785 790 795 

Ala Lys Glu Asp Ser Phe Leu Tyr Pro Pro His Ser Cys Glu Glu Ala 

800 805 810 

Gly Leu Ala Asp Lys Pro Leu Phe Arg Ala Asp Leu Ala Leu Val Gly 

815 820 825 

Thr Asn Asp Ala Asp Leu Val Asp Glu Ser Thr Phe Glu Ser Pro Tyr 
830 835 840 845 

Ser Pro Glu Asn Asp Ser Ile Tyr Ser Thr Gin Ala Ser Ile Leu Ser 

850 855 860 

Leu His Gly Ser Ser Cys Gly Asp Gly Leu Asn Tyr Gly Ser Ser Leu 

865 870 875 

Pro Ser Ser Pro Pro Lys Asp Val Ile Glu Asn Ser Cys Asp Pro Val 
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880 885 890 

Leu Asp Leu His Met Ser Leu Glu Glu Leu Tyr Thr Gin Asn Leu Leu 

895 900 905 

Glu Arg Gin Asp Glu Asn Thr Pro Ser Val Asp He Ser Met Gly Pro 
910 915 920 925 

Ala Ser Gly Phe Thr He Asn Asp Tyr Thr Pro Ala Asn Ala He Glu 

930 935 940 

Gin Gin Tyr Glu Cys Glu Asn Thr He Val Trp Thr Glu Ser His Leu 

945 950 955 

Pro Ser Glu Val He Ser Ser Ala Glu Leu Pro Ser Val Leu Pro Asp 

960 965 970 

Ser Ala Gly Lys Gly Ser Glu Tyr Leu Leu Glu Gin Ser Ser Leu Ala 

975 980 985 

Cys Asn Ala Glu Cys Val Met Leu Gin Asn Val Ser Lys Glu Ser Phe 
990 995 1000 1005 

Glu Arg Thr He Asn He Ala Lys Gly Asn Ser Ser Leu Gly Met Thr 

1010 1015 1020 

Val Ser Ala Asn Lys Asp Gly Leu Gly Met lie Val Arg Ser He He 

1025 1030 1035 

His Gly Gly Ala He Ser Arg Asp Gly Arg He Ala He Gly Asp Cys 

1040 1045 1050 

He Leu Ser He Asn Glu Glu Ser Thr He Ser Val Thr Asn Ala Gin 

1055 1060 1065 

Ala Arg Ala Met Leu Arg Arg His Ser Leu He Gly Pro Asp He Lys 
1070 1075 1080 1085 

He Thr Tyr Val Pro Ala Glu His Leu Glu Glu Phe Lys He Ser Leu 
1090 1095 1100 
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Gly Gin Gin Ser Gly Arg Val Met Ala Leu Asp He Phe Ser Ser Tyr 

1105 1110 1115 

Thr Gly Arg Asp lie Pro Glu Leu Pro Glu Arg Glu Glu Gly Glu Gly 

1120 1125 1130 

Glu Glu Ser Glu Leu Gin Asn Thr Ala Tyr Ser Asn Trp Asn Gin Pro 

1135 1140 1145 

Arg Arg Val Glu Leu Trp Arg Glu Pro Ser Lys Ser Leu Gly He Ser 
1150 1155 1160 1165 

He Val Gly Gly Arg Gly Met Gly Ser Arg Leu Ser Asn Gly Glu Val 

1170 1175 1180 

Met Arg Gly He Phe He Lys His Val Leu Glu Asp Arg Pro Ala Gly 

1185 1190 1195 

Lys Asn Gly Thr Leu Lys Pro Gly Asp Arg He Val Glu Val Asp Gly 

1200 1205 1210 

Met Asp Leu Arg Asp Ala Ser His Glu Gin Ala Val Glu Ala He Arg 

1215 1220 1225 

Lys Ala Gly Asn Pro Val Val Phe Met Val Gin Ser He He Asn Arg 
1230 1235 1240 1245 

Pro Arg Lys Ser Pro Leu Pro Ser Leu Leu His Asn Leu Tyr Pro Lys 

1250 1255 1260 

Tyr Asn Phe Ser Ser Thr Asn Pro Phe Ala Asp Ser Leu Gin He Asn 

1265 1270 1275 

Ala Asp Lys Ala Pro Ser Gin Ser Glu Ser Glu Pro Glu Lys Ala Pro 

1280 1285 1290 

Leu Cys Ser Val Pro Pro Pro Pro Pro Ser Ala Phe Ala Glu Met Gly 

1295 1300 1305 

Ser Asp His Thr Gin Ser Ser Ala Ser Lys He Ser Gin Asp Val Asp 
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1310 1315 1320 1325 

Lys Glu Asp Glu Phe Gly Tyr Ser Trp Lys Asn He Arg Glu Arg Tyr 

1330 1335 1340 

Gly Thr Leu Thr Gly Glu Leu His Met lie Glu Leu Glu Lys Gly His 

1345 1350 1355 

Ser Gly Leu Gly Leu Ser Leu Ala Gly Asn Lys Asp Arg Ser Arg Met 

1360 1365 1370 

Ser Val Phe He Val Gly He Asp Pro Asn Gly Ala Ala Gly Lys Asp 

1375 1380 1385 

Gly Arg Leu Gin He Ala Asp Glu Leu Leu Glu He Asn Gly Gin He 
1390 1395 1400 1405 

Leu Tyr Gly Arg Ser His Gin Asn Ala Ser Ser He He Lys Cys Ala 

1410 1415 1420 

Pro Ser Lys Val Lys He He Phe He Arg Asn Lys Asp Ala Val Asn 

1425 1430 1435 

Gin Met Ala Val Cys Pro Gly Asn Ala Val Glu Pro Leu Pro Ser Asn 

1440 1445 1450 

Ser Glu Asn Leu Gin Asn Lys Glu Thr Glu Pro Thr Val Thr Thr Ser 

1455 1460 1465 

Asp Ala AU Val Asp Leu Ser Ser Phe Lys Asn Val Gin His Leu Glu 
1470 1475 1480 1485 

Leu Pro Lys Asp Gin Gly Gly Leu Gly He Ala He Ser Glu Glu Asp 

1490 1495 1500 

Thr Leu Ser Gly Val He He Lys Ser Leu Thr Glu His Gly Val Ala 

1505 1510 1515 

Ala Thr Asp Gly Arg Leu Lys Val Gly Asp Gin He Leu Ala Val Asp 
1520 1525 1530 
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Asp Glu He Val Val Gly Tyr Pro He Glu Lys Phe He Ser Leu Leu 

1535 1540 1545 

Lys Thr Ala Lys Met Thr Val Lys Leu Thr lie His Ala Glu Asn Pro 
1550 1555 1560 1565 

Asp Ser Gin Ala Val Pro Ser Ala Ala Gly Ala Ala Ser Gly Glu Lys 

1570 1575 1580 

Lys Asn Ser Ser Gin Ser Leu Met Val Pro Gin Ser Gly Ser Pro Glu 

1585 1590 1595 

Pro Glu Ser He Arg Asn Thr Ser Arg Ser Ser Thr Pro Ala He Phe 

1600 1605 1610 

Ala Ser Asp Pro Ala Thr Cys Pro He He Pro Gly Cys Glu Thr Thr 

1615 1620 1625 

He Glu lie Ser Lys Gly Arg Thr Gly Leu Gly Leu Ser He Val Gly 
1630 1635 1640 1645 

Gly Ser Asp Thr Leu Leu Gly Ala Phe He He His Glu Val Tyr Glu 

1650 1655 1660 

Glu Gly Ala Ala Cys Lys Asp Gly Arg Leu Trp Ala Gly Asp Gin He 

1665 1670 1675 

Leu Glu Val Asn Gly He Asp Leu Arg Lys Ala Thr His Asp Glu Ala 

1680 1685 1690 

He Asn Val Leu Arg Gin Thr Pro Gin Arg Val Arg Leu Thr Leu Tyr 

1695 1700 1705 

Arg Asp Glu Ala Pro Tyr Lys Glu Glu Glu Val Cys Asp Thr Leu Thr 
1710 1715 1720 1725 

He Glu Leu Gin Lys Lys Pro Gly Lys Gly Leu Gly Leu Ser He Val 

1730 1735 1740 

Gly Lys Arg Asn Asp Thr Gly Val Phe Val Ser Asp He Val Lys Gly 
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1745 1750 1755 

Gly He Ala Asp Pro Asp Gly Arg Leu He Gin Gly Asp Gin He Leu 

1760 1765 1770 

Leu Val Asn Gly Glu Asp Val Arg Asn Ala Ser Gin Glu Ala Val Ala 

1775 1780 1785 

Ala Leu Leu Lys Cys Ser Leu Gly Thr Val Thr Leu Glu Val Gly Arg 
1790 1795 1800 1805 

He Lys Ala Gly Pro Phe His Ser Glu Arg Arg Pro Ser Gin Thr Ser 

1810 1815 1820 

Gin Val Ser Glu Gly Ser Leu Ser Ser Phe Thr Phe Pro Leu Ser Gly 

1825 1830 1835 

Ser Ser Thr Ser Glu Ser Leu Glu Ser Ser Ser Lys Lys Asn Ala Leu 

1840 1845 1850 

Ala Ser Glu lie Gin Gly Leu Arg Thr Val Glu Met Lys Lys Gly Pro 

1855 1860 1865 

Thr Asp Ser Leu Gly He Ser He Ala Gly Gly Val Gly Ser Pro Leu 
1870 1875 1880 1885 

Gly Asp Val Pro lie Phe He Ala Met Met His Pro Thr Gly Val Ala 

1890 1895 1900 

Ala Gin Thr Gin Lys Leu Arg Val Gly Asp Arg He Val Thr He Cys 

1905 1910 1915 

Gly Thr Ser Thr Glu Gly Met Thr His Thr Gin Ala Val Asn Leu Leu 

1920 1925 1930 

Lys Asn Ala Ser Gly Ser He Glu Met Gin Val Val Ala Gly Gly Asp 

1935 1940 1945 

Val Ser Val Val Thr Gly His His Gin Glu Pro Ala Ser Ser Ser Leu 
1950 1955 1960 1965 
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Ser Phe Thr Gly Leu Thr Ser Thr Ser He Phe Gin Asp Asp Leu Gly 

1970 1975 1980 

Pro Pro Gin Cys Lys Ser lie Thr Leu Glu Arg Gly Pro Asp Gly Leu 

1985 1990 1995 

Gly Phe Ser He Val Gly Gly Tyr Gly Ser Pro His Gly Asp Leu Pro 

2000 2005 2010 

He Tyr Val Lys Thr Val Phe Ala Lys Gly Ala Ala Ser Glu Asp Gly 

2015 2020 2025 

Arg Leu Lys Arg Gly Asp Gin He He Ala Val Asn Gly Gin Ser Leu 
2030 2035 2040 2045 

Glu Gly Val Thr His Glu Glu Ala Val Ala He Leu Lys Arg Thr Lys 

2050 2055 2060 

Gly Thr Val Thr Leu Met Val Leu Ser 
2065 2070 

<210> 84 
<211> 1239 
<212> PRT 

<213> Hoao sapience 
<400> 84 

Met Leu Glu Ala lie Asp Lys Asn Arg Ala Leu His Ala 
1 5 10 

Ala Glu Arg Leu Gin Thr Lys Leu Arg Glu Arg Gly Asp Val Ala Asn 

15 20 25 

Glu Asp Lys Leu Ser Leu Leu Lys Ser Val Leu Gin Ser Pro Leu Phe 
30 35 40 45 
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Ser Gin He Leu Ser Leu Gin Thr Ser Val Gin Gin Leu Lys Asp Gin 

50 55 60 

Val Asn He Ala Thr Ser Ala Thr Ser Asn He Glu Tyr Ala His Val 

65 70 75 

Pro His Leu Ser Pro Ala Val He Pro Thr Leu Gin Asn Glu Ser Phe 

80 85 90 

Leu Leu Ser Pro Asn Asn Gly Asn Leu Glu Ala Leu Thr Gly Pro Gly 

95 100 105 

He Pro His He Asn Gly Lys Pro Ala Cys Asp Glu Phe Asp Gin Leu 
110 115 120 125 

He Lys Asn Met Ala Gin Gly Arg His Val Glu Val Phe Glu Leu Leu 

130 135 140 

Lys Pro Pro Ser Gly Gly Leu Gly Phe Ser Val Val Gly Leu Arg Ser 

145 150 155 

Glu Asn Arg Gly Glu Leu Gly He Phe Val Gin Glu He Gin Glu Gly 

160 165 170 

Ser Val Ala His Arg Asp Gly Arg Leu Lys Glu Thr Asp Gin He Leu 

175 180 185 

Ala He Asn Gly Gin Ala Leu Asp Gin Thr He Thr His Gin Gin Ala 
190 195 200 205 

He Ser He Leu Gin Lys Ala Lys Asp Thr Val Gin Leu Val He Ala 

210 215 220 

Arg Gly Ser Leu Pro Gin Leu Val Ser Pro He Val Ser Arg Ser Pro 

225 230 235 

Ser Ala Ala Ser Thr He Ser Ala His Ser Asn Pro Val His Trp Gin 

240 245 250 

His Met Glu Thr He Glu Leu Val Asn Asp Gly Ser Gly Leu Gly Phe 
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255 260 265 

Gly He He Gly Gly Lys Ala Thr Gly Val He Val Lys Thr He Leu 
270 275 280 285 

Pro Gly Gly Val Ala Asp Gin His Gly Arg Leu Cys Ser Gly Asp His 

290 295 300 

He Leu Lys He Gly Asp Thr Asp Leu Ala Gly Met Ser Ser Glu Gin 

305 310 315 

Val Ala Gin Val Leu Arg Gin Cys Gly Asn Arg Val Lys Leu Met He 

320 325 330 

Ala Arg Ser Ala He Glu Glu Arg Thr Ala Pro Thr Ala Leu Gly lie 

335 340 345 

Thr Leu Ser Ser Ser Pro Thr Ser Thr Pro Glu Leu Arg Val Asp Ala 
350 355 360 365 

Ser Thr Gin Lys Gly Glu Glu Ser Glu Thr Phe Asp Val Glu Leu Thr 

370 375 380 

Lys Asn Val Gin Gly Leu Gly He Thr He Ala Gly Tyr He Gly Asp 

385 390 395 

Lys Lys Leu Glu Pro Ser Gly He Phe Val Lys Ser He Thr Lys Ser 

400 405 410 

Ser Ala Val Glu His Asp Gly Arg He Gin He Gly Asp Gin He He 

415 420 425 

Ala Val Asp Gly Thr Asn Leu Gin Gly Phe Thr Asn Gin Gin Ala Val 
430 435 440 445 

Glu Val Leu Arg His Thr Gly Gin Thr Val Leu Leu Thr Leu Met Arg 

450 455 460 

Arg Gly Met Lys Gin Glu Ala Glu Leu Met Ser Arg Glu Asp Val Thr 
465 470 475 
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Lys Asp Ala Asp Leu Ser Pro Val Asn Ala Ser lie He Lys Glu Asn 

480 485 490 

Tyr Glu Lys Asp Glu Asp Phe Leu Ser Ser Thr Arg Asn Thr Asn He 

495 500 505 

Leu Pro Thr Glu Glu Glu Gly Tyr Pro Leu Leu Ser Ala Glu He Glu 
510 515 520 525 

Glu He Glu Asp Ala Gin Lys Gin Glu Ala Ala Leu Leu Thr Lys Trp 

530 535 540 

Gin Arg He Met Gly He Asn Tyr Glu He Val Val Ala His Val Ser 

545 550 555 

Lys Phe Ser Glu Asn Ser Gly Leu Gly He Ser Leu Glu Ala Thr Val 

560 565 570 

Gly His His Phe He Arg Ser Val Leu Pro Glu Gly Pro Val Gly His 

575 580 585 

Ser Gly Lys Leu Phe Ser Gly Asp Glu Leu Leu Glu Val Asn Gly He 
590 595 600 605 

Thr Leu Leu Gly Glu Asn His Gin Asp Val Val Asn He Leu Lys Glu 

610 615 620 

Leu Pro He Glu Val Thr Met Val Cys Cys Arg Arg Thr Val Pro Pro 

625 630 635 

Thr Thr Gift Ser Glu Leu Asp Ser Leu Asp Leu Cys Asp lie Glu Leu 

640 645 650 

Thr Glu Lys Pro His Val Asp Leu Gly Glu Phe He Gly Ser Ser Glu 

655 660 665 

Pro Glu Asp Pro Val Leu Ala Met Thr Asp Ala Gly Gin Ser Thr Glu 
670 675 680 685 

Glu Val Gin Ala Pro Leu Ala Met Trp Glu Ala Gly He Gin His He 



108/166 



690 695 700 

Met Leu Glu Lys Gly Ser Lys Gly Leu Gly Phe Ser lie Leu Asp Tyr 

705 710 715 

Gin Asp Pro lie Asp Pro Ala Ser Thr Val He lie He Arg Ser Leu 

720 725 730 

Val Pro Gly Gly He Ala Glu Lys Asp Gly Arg Leu Leu Pro Gly Asp 

735 740 745 

Arg Leu Met Phe Val Asn Asp Val Asn Leu Glu Asn Ser Ser Leu Glu 
750 755 760 765 

Glu Ala Val Glu Ala Leu Lys Gly Ala Pro Ser Gly Thr Val Arg He 

770 775 780 

Gly Val Ala Lys Pro Leu Pro Leu Ser Pro Glu Glu Gly Tyr Val Ser 

785 790 795 

Ala Lys Glu Asp Ser Phe Leu Tyr Pro Pro His Ser Cys Glu Glu Ala 

800 805 810 

Gly Leu Ala Asp Lys Pro Leu Phe Arg Ala Asp Leu Ala Leu Val Gly 

815 820 825 

Thr Asn Asp Ala Asp Leu Val Asp Glu Ser Thr Phe Glu Ser Pro Tyr 
830 835 840 845 

Ser Pro Glu Asa Asp Ser lie Tyr Ser Thr Gin Ala Ser He Leu Ser 

850 855 860 

Leu His Gly Ser Ser Cys Gly Asp Gly Leu Asn Tyr Gly Ser Ser Leu 

865 870 875 

Pro Ser Ser Pro Pro Lys Asp Val He Glu Asn Ser Cys Asp Pro Val 

880 885 890 

Leu Asp Leu His Met Ser Leu Glu Glu Leu Tyr Thr Gin Asn Leu Leu 
895 900 905 



109/166 



Glu Arg Gin Asp Glu Asn Thr Pro Ser Val Asp He Ser Met Gly Pro 
910 915 920 925 

Ala Ser Gly Phe Thr He Asn Asp Tyr Thr Pro Ala Asn Ala He Glu 

930 935 940 

Gin Gin Tyr Glu Cys Glu Asn Thr He Val Trp Thr Glu Ser His Leu 

945 950 955 

Pro Ser Glu Val He Ser Ser Ala Glu Leu Pro Ser Val Leu Pro Asp 

960 965 970 

Ser Ala Gly Lys Gly Ser Glu His Leu Leu Glu Gin Ser Ser Leu Ala 

975 980 985 

Cys Asn Ala Glu Cys Val Met Leu Gin Asn Val Ser Lys Glu Ser Phe 
990 995 1000 1005 

Glu Arg Thr He Asn He Ala Lys Gly Asn Ser Ser Leu Gly Met Thr 

1010 1015 1020 

Val Ser Ala Asn Lys Asp Gly Leu Gly Met He Val Arg Ser He He 

1025 1030 1035 

His Gly Gly Ala lie Ser Arg Asp Gly Arg He Ala He Gly Asp Cys 

1040 1045 1050 

He Leu Ser He Asn Glu Glu Ser Thr He Ser Val Thr Asn Ala Gin 

1055 1060 1065 

Ala Arg Ala Met Leu Arg Arg His Ser Leu He Gly Pro Asp He Lys 
1070 1075 1080 1085 

He Thr Tyr Val Pro Ala Glu His Leu Glu Glu Phe Lys He Ser Leu 

1090 1095 1100 

Gly Gin Gin Ser Gly Arg Val Met Ala Leu Asp lie Phe Ser Ser Tyr 

1105 1110 1115 

Thr Gly Arg Asp He Pro Glu Leu Pro Glu Arg Glu Glu Gly Glu Gly 
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1120 1125 1130 

Glu Glu Ser Glu Leu Gin Asn Thr Ala Tyr Ser Asn Trp Asn Gin Pro 

1135 1140 1145 

Arg Arg Val Glu Leu Trp Arg Glu Pro Ser Lys Ser Leu Gly He Ser 
1150 1155 1160 1165 

He Val Gly Gly Arg Gly Met Gly Ser Arg Leu Ser Asn Gly Glu Val 

1170 1175 1180 

Met Arg Gly He Phe He Lys His Val Leu Glu Asp Ser Pro Ala Gly 

1185 1190 1195 

Lys Asn Gly Thr Leu Lys Pro Gly Asp Arg He Val Glu Val Asp Gly 

1200 1205 1210 

Met Asp Leu Arg Asp Ala Ser His Glu Gin Ala Val Glu Ala He Arg 

1215 1220 1225 

Lys Ala Gly Asn Pro Val Val Phe Met Val 
1230 1235 

<210> 85 

<211> 6540 

<212> DNA 

< 2 1 3 > Homo sapience 

<220> 
<221> CDS 

<222> (71). ..(6070) 
<400> 85 

ttctcagtca cgcagttcca ttttaattgc tgttaatcat ttcagagaag aacactgaac 
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tttgaaaaaa atg ttg gaa gcc att gac aaa aat egg gec ctg cat gca 109 
Met Leu Glu Ala lie Asp Lys Asn Arg Ala Leu His Ala 
1 5 10 

gca gag cgc ttg caa acc aag ctg cga gaa cgt ggg gat gta gca aat 157 
Ala Glu Arg Leu Gin Thr Lys Leu Arg Glu Arg Gly Asp Val Ala Asn 
15 20 25 

gaa gac aaa ctg age ctt ctg aag tea gtc ctg cag age cct etc ttc 205 
Glu Asp Lys Leu Ser Leu Leu Lys Ser Val Leu Gin Ser Pro Leu Phe 
30 35 40 45 

agt cag att ctg age ctt cag act tct gta cag cag ctg aaa gac cag 253 
Ser Gin lie Leu Ser Leu Gin Thr Ser Val Gin Gin Leu Lys Asp Gin 
50 55 60 

gta aat att gca act tea gca act tea aat att gaa tat gcc cac gtt 301 
Val Asn He Ala Thr Ser Ala Thr Ser Asn lie Glu Tyr Ala His Val 
65 70 75 

cct cat etc age cca get gtg att cct act ctg caa aat gaa teg ttt 349 
Pro His Leu Ser Pro Ala Val He Pro Thr Leu Gin Asn Glu Ser Phe 
80 85 90 

tta tta tec cca aac aat ggg aat ctg gaa gca ctt aca gga cct ggt 397 
Leu Leu Ser Pro Asn Asn Gly Asn Leu Glu Ala Leu Thr Gly Pro Gly 
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95 100 105 

att cca cac att aat ggg aaa cct get tgt gat gaa ttt gat cag ctt 445 
He Pro His He Asn Gly Lys Pro Ala Cys Asp Glu Phe Asp Gin Leu 
110 115 120 125 

ate aaa aat atg gec cag ggt cgc cat gta gaa gtt ttt gag etc etc 493 
He Lys Asn Met Ala Gin Gly Arg His Val Glu Val Phe Glu Leu Leu 
130 135 140 

aaa cct cca tct gga ggc ctt ggg ttt agt gtt gtg gga eta aga agt 541 
Lys Pro Pro Ser Gly Gly Leu Gly Phe Ser Val Val Gly Leu Arg Ser 
145 150 155 

gaa aac aga gga gag ctg gga ata ttt gtt caa gag ata caa gag ggc 589 
Glu Asn Arg Gly Glu Leu Gly He Phe Val Gin Glu He Gin Glu Gly 
160 165 170 

agt gtg gec cat aga gat gga aga ttg aaa gaa act gat caa att ctt 637 
Ser Val Ala His Arg Asp Gly Arg Leu Lys Glu Thr Asp Gin He Leu 
175 180 185 

get ate aat gga cag get ctt gat cag aca att aca cat cag cag get 685 
Ala He Asn Gly Gin Ala Leu Asp Gin Thr He Thr His Gin Gin Ala 
190 195 200 205 

ate age ate ctg cag aaa gec aaa gat act gtc cag eta gtt att gec 733 
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He Ser He Leu Gin Lys Ala Lys Asp Thr Val Gin Leu Val He Ala 
210 215 220 

aga ggc tea ttg cct cag ctt gtc age ccc ata gtt tec cgt tct cca 781 

Arg Gly Ser Leu Pro Gin Leu Val Ser Pro He Val Ser Arg Ser Pro 
225 230 235 

tct gca gec age aca att tea get cac tct aat ccg gtt cac tgg caa 829 

Ser Ala Ala Ser Thr lie Ser Ala His Ser Asn Pro Val His Trp Gin 

240 245 250 

cac atg gaa acg att gaa ttg gtg aat gat gga tct ggt ttg gga ttt 877 

His Met Glu Thr lie Glu Leu Val Asn Asp Gly Ser Gly Leu Gly Phe 
255 260 265 

ggc ate ata gga gga aaa gca act ggt gtg ata gta aaa acc att ctg 925 

Gly lie He Gly Gly Lys Ala Thr Gly Val lie Val Lys Thr He Leu 
270 275 280 285 

cct gga gga gta get gat cag cat ggg cgt tta tgc agt gga gac cac 973 

Pro Gly Gly Val Ala Asp Gin His Gly Arg Leu Cys Ser Gly Asp His 
290 295 300 

att eta aag att ggt gac aca gat eta gca gga atg age agt gag caa 1021 

lie Leu Lys He Gly Asp Thr Asp Leu Ala Gly Met Ser Ser Glu Gin 
305 310 315 
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gta gca caa gtc ctt agg caa tgt gga aat aga gtt aag ttg atg att 1069 

Val Ala Gin Val Leu Arg Gin Cys Gly Asn Arg Val Lys Leu Met He 

320 325 330 

gca aga agt gcc ata gaa gaa cgt aca gca ccc act get ttg ggc ate 1117 

Ala Arg Ser Ala He Glu Glu Arg Thr Ala Pro Thr Ala Leu Gly lie 

335 340 345 

acc etc tec tea tec cca act tea acg cca gag ttg egg gtt gat get 1165 

Thr Leu Ser Ser Ser Pro Thr Ser Thr Pro Glu Leu Arg Val Asp Ala 

350 355 360 365 

tct act cag aaa ggt gaa gaa agt gag aca ttt gat gta gaa etc act 1213 

Ser Thr Gin Lys Gly Glu Glu Ser Glu Thr Phe Asp Val Glu Leu Thr 

370 375 380 

aaa aat gtc caa gga tta gga att acc att get ggc tac att gga gat 1261 

Lys Asn Val Gin Gly Leu Gly He Thr He Ala Gly Tyr He Gly Asp 

385 390 395 

aaa aaa ttg gaa cct tea gga ate ttt gta aag age att aca aaa age 1309 

Lys Lys Leu Glu Pro Ser Gly He Phe Val Lys Ser He Thr Lys Ser 

400 405 410 

agt gcc gtt gag cat gat gga aga ate caa att gga gac caa att ata 1357 

Ser Ala Val Glu His Asp Gly Arg He Gin He Gly Asp Gin He He 

415 420 425 
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gca gta gat ggc aca aac ctt cag ggt 
Ala Val Asp Gly Thr Asn Leu Gin Gly 
430 435 

gag gta ttg cga cat aca gga caa act 

Glu Val Leu Arg His Thr Gly Gin Thr 
450 

aga gga atg aag cag gaa gcc gag etc 

Arg Gly Met Lys Gin Glu Ala Glu Leu 

465 470 

aaa gat gca gat ttg tct cct gtt aat 
Lys Asp Ala Asp Leu Ser Pro Val Asn 
480 485 

tat gaa aaa gat gaa gat ttt tta tct 
Tyr Glu Lys Asp Glu Asp Phe Leu Ser 
495 500 

tta cca act gaa gaa gaa ggg tat cca 
Leu Pro Thr Glu Glu Glu Gly Tyr Pro 
510 515 

gaa ata gaa gat gca caa aaa caa gaa 
Glu He Glu Asp Ala Gin Lys Gin Glu 



ttt act aat cag caa gca gta 1405 
Phe Thr Asn Gin Gin Ala Val 
440 445 

gtg etc ctg aca eta atg agg 1453 
Val Leu Leu Thr Leu Met Arg 
455 460 

atg tea agg gaa gac gtc aca 1501 
Met Ser Arg Glu Asp Val Thr 
475 

gcc age ata ate aaa gaa aat 1549 
Ala Ser He He Lys Glu Asn 
490 

teg acg aga aac ace aac ata 1597 
Ser Thr Arg Asn Thr Asn He 
505 

tta ctg tea get gag ata gaa 1645 
Leu Leu Ser Ala Glu He Glu 
520 525 

get get ctg ctg aca aaa tgg 1693 
Ala Ala Leu Leu Thr Lys Trp 
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530 535 540 

caa agg att atg gga att aac tat gaa ata gtg gtg gcc cat gtg age 1741 
Gin Arg lie Met Gly lie Asn Tyr Glu He Val Val Ala His Val Ser 
545 550 555 

aag ttt agt gag aac agt gga ttg ggg ata age ctg gaa gcg aca gtg 1789 
Lys Phe Ser Glu Asn Ser Gly Leu Gly He Ser Leu Glu Ala Thr Val 
560 565 570 

gga cat cat ttt ate cga tct gtt eta cca gag ggt cct gtt gga cac 1837 
Gly His His Phe He Arg Ser Val Leu Pro Glu Gly Pro Val Gly His 
575 580 585 

age ggg aag etc ttc agt gga gac gag eta ttg gaa gta aat ggc ata 1885 
Ser Gly Lys Leu Phe Ser Gly Asp Glu Leu Leu Glu Val Asn Gly He 
590 595 600 605 

act tta ctt ggg gaa aat cac caa gat gtg gtg aat ate tta aaa gaa 1933 
Thr Leu Leu, Gly Glu Asn His Gin Asp Val Val Asn He Leu Lys Glu 
610 615 620 

ctg cct ata gaa gtg aca atg gtg tgc tgt cgt cga act gtg cca ccc 1981 
Leu Pro He Glu Val Thr Met Val Cys Cys Arg Arg Thr Val Pro Pro 
625 630 635 



acc acc caa tea gaa ttg gat age ctg gac tta tgt gat att gag eta 



2029 
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Thr Thr Gin Ser Glu Leu Asp Ser Leu Asp Leu Cys Asp He Glu Leu 
640 645 650 

aca gaa aag cct cac gta gat eta ggt gag ttc ate ggg tea tea gag 2077 
Thr Glu Lys Pro His Val Asp Leu Gly Glu Phe He Gly Ser Ser Glu 
655 660 665 

cca gag gat cca gtg ctg gcg atg act gat gcg ggt cag agt aca gaa 2125 
Pro Glu Asp Pro Val Leu Ala Met Thr Asp Ala Gly Gin Ser Thr Glu 
670 675 680 685 

gag gtt caa gca cct ttg gec atg tgg gag get ggc att cag cac ata 2173 
Glu Val Gin Ala Pro Leu Ala Met Trp Glu Ala Gly lie Gin His He 
690 695 700 

atg ctg gag aaa ggg age aaa gga ctt ggt ttt age att tta gat tat 2221 
Met Leu Glu Lys Gly Ser Lys Gly Leu Gly Phe Ser He Leu Asp Tyr 
705 710 715 

cag gat cca**tt gat cca gca age act gtg att ata att cgt tct ttg 2269 
Gin Asp Pro He Asp Pro Ala Ser Thr Val He He He Arg Ser Leu 
720 725 730 

gtg cct ggc ggc att get gaa aag gat gga cga ctt ctt cct ggt gac 2317 
Val Pro Gly Gly He Ala Glu Lys Asp Gly Arg Leu Leu Pro Gly Asp 
735 740 745 
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cga etc atg ttt gta aac gat gtt aac ttg gaa aac age agt ctt gag 2365 

Arg Leu Met Phe Val Asn Asp Val Asn Leu Glu Asn Ser Ser Leu Glu 

750 755 760 765 

gaa get gta gaa gca ctg aag gga gca ccg tea ggg act gtg aga ata 2413 

Glu Ala Val Glu Ala Leu Lys Gly Ala Pro Ser Gly Thr Val Arg He 
770 775 780 

gga gtt get aag cct tta ccc ctt tea cca gaa gaa ggt tat gtt tct 2461 

Gly Val Ala Lys Pro Leu Pro Leu Ser Pro Glu Glu Gly Tyr Val Ser 
785 790 795 

get aag gag gat tec ttt etc tac cca cca cac tec tgt gag gaa gca 2509 

Ala Lys Glu Asp Ser Phe Leu Tyr Pro Pro His Ser Cys Glu Glu Ala 
800 805 810 

ggg ctg get gac aaa ccc etc ttc agg get gac ttg get ctg gtg ggc 2557 

Gly Leu Ala Asp Lys Pro Leu Phe Arg Ala Asp Leu Ala Leu Val Gly 
815 820 825 

aca aat gat get gac tta gta gat gaa tec aca ttt gag tct cca tac 2605 

Thr Asn Asp Ala Asp Leu Val Asp Glu Ser Thr Phe Glu Ser Pro Tyr 

830 835 840 845 

tct cct gaa aat gac age ate tac tct act caa gee tct att tta tct 2653 

Ser Pro Glu Asn Asp Ser He Tyr Ser Thr Gin Ala Ser He Leu Ser 
850 855 860 
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ctt cat ggc agt tct tgt ggt gat ggc ctg aac tat ggt tct tec ctt 
Leu His Gly Ser Ser Cys Gly Asp Gly Leu Asn Tyr Gly Ser Ser Leu 
865 870 875 



2701 



cca tea tct cct cct aag gat gtt att gaa aat tct tgt gat cca gta 
Pro Ser Ser Pro Pro Lys Asp Val He Glu Asn Ser Cys Asp Pro Val 
880 885 890 



2749 



ctt gat ctg cat atg tct ctg gag gaa eta tat acc cag aat etc ctg 
Leu Asp Leu His Met Ser Leu Glu Glu Leu Tyr Thr frln Asn Leu Leu 
895 900 905 



2797 



gaa aga cag gat gag aat aca cct teg gtg gac ata agt atg ggg cct 
Glu Arg Gin Asp Glu Asn Thr Pro Ser Val Asp He Ser Met Gly Pro 
910 915 920 925 



2845 



get tct ggc ttt act ata aat gac tac aca cct gca aat get att gaa 
Ala Ser Gly Phe Thr He Asn Asp Tyr Thr Pro Ala Asn Ala He Glu 
930 935 940 



2893 



caa caa tat gaa tgt gaa aac aca ata gtg tgg act gaa tct cat tta 
Gin Gin Tyr Glu Cys Glu Asn Thr He Val Trp Thr Glu Ser His Leu 
945 950 955 



2941 



cca agt gaa gtt ata tea agt gca gaa ctt cct tct gtg eta ccc gat 
Pro Ser Glu Val He Ser Ser Ala Glu Leu Pro Ser Val Leu Pro Asp 



2989 
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960 965 970 

tea get gga aag ggc tct gag cac ctg ctt gaa cag age tec ctg gec 3037 
Ser Ala Gly Lys Gly Ser Glu His Leu Leu Glu Gin Ser Ser Leu Ala 
975 980 985 

tgt aat get gag tgt gtc atg ctt caa aat gta tct aaa gaa tct ttt 3085 
Cys Asn Ala Glu Cys Val Met Leu Gin Asn Val Ser Lys Glu Ser Phe 
990 995 1000 1005 

gaa agg act att aat ata gca aaa ggc aat tct age eta gga atg aca 3133 
Glu Arg Thr He Asn lie Ala Lys Gly Asn Ser Ser Leu Gly Met Thr 
1010 1015 1020 

gtt agt get aat aaa gat ggc ttg ggg atg ate gtt cga age att att 3181 
Val Ser Ala Asn Lys Asp Gly Leu Gly Met He Val Arg Ser He He 
1025 1030 1035 

cat gga ggt gec att agt cga gat ggc egg att gee att ggg gac tgc 3229 
His Gly Gly Ala lie Ser Arg Asp Gly Arg lie Ala He Gly Asp Cys 
1040 1045 1050 

ate ttg tec att aat gaa gag tct ace ate agt gta ace aat gee cag 3277 
He Leu Ser lie Asn Glu Glu Ser Thr He Ser Val Thr Asn Ala Gin 
1055 1060 1065 

gca cga get atg ttg aga aga cat tct etc att ggc cct gac ata aaa 3325 
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Ala Arg Ala Met Leu Arg Arg His Ser Leu He Gly Pro Asp He Lys 
1070 1075 1080 1085 



att act tat gtg cct gca gaa cat ttg gaa gag ttc aaa ata age ttg 
He Thr Tyr Val Pro Ala Glu His Leu Glu Glu Phe Lys He Ser Leu 
1090 1095 1100 



3373 



gga caa caa tct gga aga gta atg gca ctg gat att ttt tct tea tac 
Gly Gin Gin Ser Gly Arg Val Met Ala Leu Asp He Phe Ser Ser Tyr 
1105 1110 1115 



3421 



act ggc aga gac att cca gaa tta cca gag cga gaa gag gga gag ggt 
Thr Gly Arg Asp lie Pro Glu Leu Pro Glu Arg Glu Glu Gly Glu Gly 
1120 1125 1130 



3469 



gaa gaa age gaa ctt caa aac aca gca tat age aat tgg aat cag ccc 
Glu Glu Ser Glu Leu Gin Asn Thr Ala Tyr Ser Asn Trp Asn Gin Pro 
1135 1140 1145 



3517 



agg egg gtg gaa etc tgg aga gaa cca age aaa tec tta ggc ate age 
Arg Arg Val Glu Leu Trp Arg Glu Pro Ser Lys Ser Leu Gly He Ser 
1150 1155 1160 1165 



3565 



att gtt ggt gga cga ggg atg ggg 
He Val Gly Gly Arg Gly Met Gly 
1170 



agt egg eta age aat 
Ser Arg Leu Ser Asn 
1175 



gga gaa gtg 
Gly Glu Val 
1180 



3613 
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atg agg ggc att ttc ate aaa cat gtt ctg gaa gat agt cca get ggc 3661 
Met Arg Gly He Phe lie Lys His Val Leu Glu Asp Ser Pro Ala Gly 
1185 1190 1195 

aaa aat gga acc ttg aaa cct gga gat aga ate gta gag gca ccc agt 3709 
Lys Asn Gly Thr Leu Lys Pro Gly Asp Arg lie Val Glu Ala Pro Ser 
1200 1205 1210 

cag tea gag tea gag cca gag aag get cca ttg tgc agt gtg ccc cca 3757 
Gin Ser Glu Ser Glu Pro Glu Lys Ala Pro Leu Cys Ser Val Pro Pro 
1215 1220 1225 

ccc cct cct tea gee ttt gec gaa atg ggt agt gat cac aca cag tea 3805 
Pro Pro Pro Ser Ala Phe Ala Glu Met Gly Ser Asp His Thr Gin Ser 
1230 1235 1240 1245 

tct gca age aaa ate tea caa gat gtg gac aaa gag gat gag ttt ggt 3853 
Ser Ala Ser Lys He Ser Gin Asp Val Asp Lys Glu Asp Glu Phe Gly 
1250 1255 1260 

- n< 

tac age tgg aaa aat ate aga gag cgt tat gga acc eta aca ggc gag 3901 
Tyr Ser Trp Lys Asn lie Arg Glu Arg Tyr Gly Thr Leu Thr Gly Glu 
1265 1270 1275 



ctg cat 
Leu His 



atg att gaa ctg gag aaa ggt cat agt ggt ttg ggc eta agt 
Met He Glu Leu Glu Lys Gly His Ser Gly Leu Gly Leu Ser 
1280 1285 1290 



3949 



123/166 



ctt get ggg aac aaa gac cga tec agg atg agt gtc ttc ata gtg ggg 3997 
Leu Ala Gly Asn Lys Asp Arg Ser Arg Met Ser Val Phe lie Val Gly 
1295 1300 1305 

att gat cca aat gga get gca gga aaa gat ggt cga ttg caa att gca 4045 
He Asp Pro Asn Gly Ala Ala Gly Lys Asp Gly Arg Leu Gin He Ala 
1310 1315 1320 1325 

gat gag ctt eta gag ate aat ggt cag att tta tat gga aga agt cat 4093 
Asp Glu Leu Leu Glu He Asn Gly Gin He Leu Tyr Gly Arg Ser His 
1330 1335 1340 

cag aat gee tea tea ate att aaa tgt gee cct tct aaa gtg aaa ata 4141 
Gin Asn Ala Ser Ser He He Lys Cys Ala Pro Ser Lys Val Lys He 
1345 1350 1355 

att ttt ate aga aat aaa gat gca gtg aat cag atg gee gta tgt cct 4189 
lie Phe lie Arg Asn Lys Asp Ala Val Asn Gin Met Ala Val Cys Pro 
1380 1365 1370 

gga aat gca gta gaa cct ttg cct tct aac tea gaa aat ctt caa aat 4237 
Gly Asn Ala Val Glu Pro Leu Pro Ser Asn Ser Glu Asn Leu Gin Asn 
1375 1380 1385 

aag gag aca gag cca act gtt act act tct gat gca get gtg gac etc 4285 
Lys Glu Thr Glu Pro Thr Val Thr Thr Ser Asp Ala Ala Val Asp Leu 
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1390 1395 1400 1405 

agt tea ttt aaa aat gtg caa cat ctg gag ctt ccc aag gat cag ggg 4333 
Ser Ser Phe Lys Asn Val Gin His Leu Glu Leu Pro Lys Asp Gin Gly 
1410 1415 1420 

ggt ttg ggt att get ate age gaa gaa gat aca etc agt gga gtc ate 4381 
Gly Leu Gly He Ala lie Ser Glu Glu Asp Thr Leu Ser Gly Val He 
1425 1430 1435 

ata aag age tta aca gag cat ggg gta gca gee acg gat gga cga etc 4429 
He Lys Ser Leu Thr Glu His Gly Val Ala Ala Thr Asp Gly Arg Leu 
1440 1445 1450 

aaa gtc gga gat cag ata ctg get gta gat gat gaa att gtt gtt ggt 4477 
Lys Val Gly Asp Gin He Leu Ala Val Asp Asp Glu He Val Val Gly 
1455 1460 1465 

tac cct att gaa aag ttt att age ctt ctg aag aca gca aag atg aca 4525 
Tyr Pro He Glu Lys Phe He Ser Leu Leu Lys Thr Ala Lys Met Thr 
1470 1475 1480 1485 

gta aaa ctt ace ate cat get gag aat cca gat tec cag get gtt cct 4573 
Val Lys Leu Thr lie His Ala Glu Asn Pro Asp Ser Gin Ala Val Pro 
1490 1495 1500 



tea gca get ggt gca gee agt gga gaa aaa aag aac age tec cag tct 
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Ser Ala Ala Gly Ala Ala Ser Gly Glu Lys Lys Asn Ser Ser Gin Ser 
1505 1510 1515 

ctg atg gtc cca cag tct ggc tec cca gaa ccg gag tec ate cga aat 4669 
Leu Met Val Pro Gin Ser Gly Ser Pro Glu Pro Glu Ser He Arg Asn 
1520 1525 1530 

aca age aga tea tea aca cca gca att ttt get tct gat cct gca ace 4717 
Thr Ser Arg Ser Ser Thr Pro Ala lie Phe Ala Ser Asp Pro Ala Thr 
1535 1540 1545 

tgc ccc att ate cct ggc tgc gaa aca acc ate gag att tec aaa ggg 4765 
Cys Pro lie He Pro Gly Cys Glu Thr Thr He Glu He Ser Lys Gly 
1550 1555 1560 1565 

cga aca ggg ctg ggc ctg age ate gtt ggg ggt tea gac acg ctg ctg 4813 
Arg Thr Gly Leu Gly Leu Ser He Val Gly Gly Ser Asp Thr Leu Leu 
1570 1575 1580 

ggt gee ttt att ate cat gaa gtt tat gaa gaa gga gca gca tgt aaa 4861 
Gly Ala Phe He He His Glu Val Tyr Glu Glu Gly Ala Ala Cys Lys 
1585 1590 1595 



gat gga 
Asp Gly 



aga etc tgg get gga gat cag ate tta gag gtg aat gga att 
Arg Leu Trp Ala Gly Asp Gin He Leu Glu Val Asn Gly He 
1600 1605 1610 



4909 



fl 
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gac ttg agg aag gcc aca cat gat gaa gca ate aat gtc ctg aga cag 
Asp Leu Arg Lys Ala Thr His Asp Glu Ala He Asn Val Leu Arg Gin 
1615 1620 1625 



4957 



acg cca cag aga gtg cgc ctg aca etc tac aga gat gag gcc cca tac 
Thr Pro Gin Arg Val Arg Leu Thr Leu Tyr Arg Asp Glu Ala Pro Tyr 
1630 1635 1640 1645 



5005 



aaa gag gag gaa gtg tgt gac acc etc act att gag ctg cag aag aag 
Lys Glu Glu Glu Val Cys Asp Thr Leu Thr He Glu Leu Gin Lys Lys 
1650 1655 1660 



5053 



ccg gga aaa ggc eta gga tta agt att gtt ggt aaa aga aac gat act 
Pro Gly Lys Gly Leu Gly Leu Ser He Val Gly Lys Arg Asn Asp Thr 
1665 1670 1675 



5101 



gga gta ttt gtg tea gac att gtc aaa gga gga att gca gat ccc gat 
Gly Val Phe Val Ser Asp He Val Lys Gly Gly He Ala Asp Pro Asp 
1680 1685 1690 * 



5149 



gga aga ctg ate cag gga gac cag ata tta ttg gtg aat ggg gaa gac 
Gly Arg Leu He Gin Gly Asp Gin He Leu Leu Val Asn Gly Glu Asp 
1695 1700 1705 



5197 



gtt cgt aat gcc tec 
Val Arg Asn Ala Ser 
1710 



caa gaa gcg gtt gcc get ttg eta aag 
Gin Glu Ala Val Ala Ala Leu Leu Lys 
1715 1720 



tgt tec 
Cys Ser 
1725 



5245 
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eta ggc aca gta acc ttg gaa gtt gga aga ate aaa get ggt cca ttc 5293 
Leu Gly Thr Val Thr Leu Glu Val Gly Arg He Lys Ala Gly Pro Phe 
1730 1735 1740 

cat tea gag agg agg cca tct caa acc age cag gtg agt gaa ggc age 5341 
His Ser Glu Arg Arg Pro Ser Gin Thr Ser Gin Val Ser Glu Gly Ser 
1745 1750 1755 

ctg tct tct ttc act ttt cca etc tct gga tec agt aca tct gag tea 5389 
Leu Ser Ser Phe Thr Phe Pro Leu Ser Gly Ser Ser Thr Ser Glu Ser 
1760 1765 1770 

ctg gaa agt age tea aag aag aat gca ttg gca tct gaa ata cag gga 5437 
Leu Glu Ser Ser Ser Lys Lys Asn Ala Leu Ala Ser Glu He Gin Gly 
1775 1780 1785 

tta aga aca gtc gaa atg aaa aag ggc cct act gac tea ctg gga ate 5485 
Leu Arg Thr Val Glu Met Lys Lys Gly Pro Thr Asp Ser Leu Gly He 
1790 * 1795 1800 1805 

age ate get gga gga gta ggc age cca ctt ggt gat gtg cct ata ttt 5533 
Ser lie Ala Gly Gly Val Gly Ser Pro Leu Gly Asp Val Pro He Phe 
1810 1815 1820 

att gca atg atg cac cca act gga gtt gca gca cag acc caa aaa etc 5581 
He Ala Met Met His Pro Thr Gly Val Ala Ala Gin Thr Gin Lys Leu 
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1825 1830 1835 

aga gtt ggg gat agg att gtc acc ate tgt ggc aca tec act gag ggc 5629 
Arg Val Gly Asp Arg He Val Thr He Cys Gly Thr Ser Thr Glu Gly 
1840 1845 1850 

atg act cac acc caa gca gtt aac eta ctg aaa aat gca tct ggc tec 5677 
Met Thr His Thr Gin Ala Val Asn Leu Leu Lys Asn Ala Ser Gly Ser 
1855 I860 1865 

att gaa atg cag gtg gtt get gga gga gac gtg agt gtg gtc aca ggt 5725 
He Glu Met Gin Val Val Ala Gly Gly Asp Val Ser Val Val Thr Gly 
1870 1875 1880 1885 

cat cat cag gag cct gca agt tec agt ctt tct ttc act ggg ctg acg 5773 
His His Gin Glu Pro Ala Ser Ser Ser Leu Ser Phe Thr Gly Leu Thr 
1890 1895 1900 

tea acc agt ata ttt cag gat gat tta gga cct cct caa tgt aag tct 5821 
Ser Thr Ser He Phe Gin Asp Asp Leu Gly Pro Pro Gin Cys Lys Ser 
1905 1910 1915 

att aca eta gag cga gga cca gat ggc tta ggc ttc agt ata gtt gga 5869 
He Thr Leu Glu Arg Gly Pro Asp Gly Leu Gly Phe Ser He Val Gly 
1920 1925 1930 



gga tat ggc age cct cat gga gac tta ccc att tat gtt aaa aca gtg 



5917 
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Gly Tyr Gly Ser Pro His Gly Asp Leu Pro He Tyr Val Lys Thr Val 
1935 1940 1945 

ttt gca aag gga gca gcc tct gaa gac gga cgt ctg aaa agg ggc gat 5965 
Phe Ala Lys Gly Ala Ala Ser Glu Asp Gly Arg Leu Lys Arg Gly Asp 
1950 1955 1960 1965 

cag ate att get gtc aat ggg cag agt eta gaa gga gtc acc cat gaa 6013 
Gin He He Ala Val Asn Gly Gin Ser Leu Glu Gly Val Thr His Glu 
1970 1975 1980 

gaa get gtt gcc ate ctt aaa egg aca aaa ggc act gtc act ttg atg 6061 
Glu Ala Val Ala He Leu Lys Arg Thr Lys Gly Thr Val Thr Leu Met 
1985 1990 1995 

gtt etc tct tgaattggct gecagaattg aaccaaccca acccctagct 6110 
Val Leu Ser 
2000 

cacctcctac tg:taaagaga atgcactggt cctgacaatt tttatgctgt gttcagcegg 6170 

gtcttcaaaa ctgtaggggg gaaataacac ttaagtttct ttttctcatc tagaaatget 6230 

ttccttactg acaacctaac atcatttttc ttttcttctt gcattttgtg aacttaaaga 6290 

gaaggaatat ttgtgtaggt gaatctcgtt tttatttgtg gagatatcta atgttttgta 6350 



130/166 



gtcacatggg caagaattat tacatgctaa gctggttagt ataaagaaag ataattctaa 



6410 



agctaaccaa agaaaatggc ttcagtaagt taggatgaaa aatgaaaata taaaataaag 



6470 



aagaaaatct cggggagttt aaaaaaaatg cctcaatttg gcaatctacc tcctctcccc 



6530 



accccaaact 



6540 



<210> 86 

<211> 6750 

<212> DNA 

<213> Homo sapience 

<220> 
<221> CDS 

<222> (71),. .(6280) 
<400> 86 

ttctcagtca cgcagttcca ttttaattgc tgttaatcat ttcagagaag aacactgaac 60 
tttgaaaaaa atg ttg gaa gcc att gac aaa aat egg gec ctg cat gca 109 



Met Leu Glu Ala He Asp Lys Asn Arg Ala Leu His Ala 



10 



gca gag cgc ttg caa acc aag ctg cga gaa cgt ggg gat gta gca aat 
Ala Glu Arg Leu Gin Thr Lys Leu Arg Glu Arg Gly Asp Val Ala Asn 
15 20 25 



157 
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gaa gac aaa ctg age ctt ctg aag tea 
Glu Asp Lys Leu Ser Leu Leu Lys Ser 
30 35 

agt cag att ctg age ctt cag act tct 

Ser Gin He Leu Ser Leu Gin Thr Ser 
50 

gta aat att gca act tea gca act tea 

Val Asn He Ala Thr Ser Ala Thr Ser 

65 70 

cct cat etc age cca get gtg att cct 
Pro His Leu Ser Pro Ala Val He Pro 
80 85 

tta tta tec cca aac aat ggg aat ctg 
Leu Leu Ser Pro Asn Asn Gly Asn Leu 
95 100 

att cca cac att aat ggg aaa cct get 
He Pro His He Asn Gly Lys Pro Ala 
110 115 

ate aaa aat atg gec cag ggt cgc cat 
He Lys Asn Met Ala Gin Gly Arg His 



gtc ctg cag age cct etc ttc 205 
Val Leu Gin Ser Pro Leu Phe 
40 45 

gta cag cag ctg aaa gac cag 253 
Val Gin Gin Leu Lys Asp Gin 
55 60 

aat att gaa tat gec cac gtt 301 
Asn He Glu Tyr Ala His Val 
75 

act ctg caa aat gaa teg ttt 349 
Thr Leu Gin Asn Glu Ser Phe 
90 

gaa gca ctt aca gga cct ggt 397 
Glu Ala Leu Thr Gly Pro Gly 
105 

tgt gat gaa ttt gat cag ctt 445 
Cys Asp Glu Phe Asp Gin Leu 
120 125 

gta gaa gtt ttt gag etc etc 493 
Val Glu Val Phe Glu Leu Leu 
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130 135 140 

aaa cct cca tct gga ggc ctt ggg ttt agt gtt gtg gga eta aga agt 541 
Lys Pro Pro Ser Gly Gly Leu Gly Phe Ser Val Val Gly Leu Arg Ser 
145 150 155 

gaa aac aga gga gag ctg gga ata ttt gtt caa gag ata caa gag ggc 589 
Glu Asn Arg Gly Glu Leu Gly lie Phe Val Gin Glu He Gin Glu Gly 
160 165 170 

agt gtg gec cat aga gat gga aga ttg aaa gaa act gat caa att ctt 637 
Ser Val Ala His Arg Asp Gly Arg Leu Lys Glu Thr Asp Gin He Leu 
175 180 185 

get ate aat gga cag get ctt gat cag aca att aca cat cag cag get 685 
Ala He Asn Gly Gin Ala Leu Asp Gin Thr He Thr His Gin Gin Ala 
190 195 200 205 

ate age ate ctg cag aaa gec aaa gat act gtc cag eta gtt att gee 733 
lie Ser Illfc&ett Gin Lys Ala Lys Asp Thr Val Gin Leu Val He Ala 
210 215 220 

aga ggc tea ttg cct cag ctt gtc age ccc ata gtt tec cgt tct cca 781 
Arg Gly Ser Leu Pro Gin Leu Val Ser Pro He Val Ser Arg Ser Pro 
225 230 235 



tct gca gee age aca att tea get cac tct aat ccg gtt cac tgg caa 



829 
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Ser AU Ala Ser Thr He Ser Ala His Ser Asn Pro Val His Trp Gin 
240 245 250 

cac atg gaa acg att gaa ttg gtg aat gat gga tct ggt ttg gga ttt 877 
His Met Glu Thr He Glu Leu Val Asn Asp Gly Ser Gly Leu Gly Phe 
255 260 265 

ggc ate ata gga gga aaa gca act ggt gtg ata gta aaa acc att ctg 925 
Gly He He Gly Gly Lys Ala Thr Gly Val He Val Lys Thr He Leu 
270 275 280 285 

cct gga gga gta get gat cag cat ggg cgt tta tgc agt gga gac cac 973 
Pro Gly Gly Val Ala Asp Gin His Gly Arg Leu Cys Ser Gly Asp His 
290 295 300 

att eta aag att ggt gac aca gat eta gca gga atg age agt gag caa 1021 
lie Leu Lys lie Gly Asp Thr Asp Leu Ala Gly Met Ser Ser Glu Gin 
305 310 315 

gta gca caa gtc ctt agg caa tgt gga aat aga gtt aag ttg atg att 1069 
Val Ala Gin Val Leu Arg Gin Cys Gly Asn Arg Val Lys Leu Met He 
320 325 330 

gca aga agt gec ata gaa gaa cgt aca gca ccc act get ttg ggc ate 1117 
Ala Arg Ser Ala He Glu Glu Arg Thr Ala Pro Thr Ala Leu Gly He 
335 340 345 



134/166 



acc etc tec tea tec cca act tea acg cca gag ttg egg gtt gat get 1165 
Thr Leu Ser Ser Ser Pro Thr Ser Thr Pro Glu Leu Arg Val Asp Ala 
350 355 360 365 

tct act cag aaa ggt gaa gaa agt gag aca ttt gat gta gaa etc act 1213 
Ser Thr Gin Lys Gly Glu Glu Ser Glu Thr Phe Asp Val Glu Leu Thr 
370 375 380 

aaa aat gtc caa gga tta gga att acc att get ggc tac att gga gat 1261 
Lys Asn Val Gin Gly Leu Gly He Thr He Ala Gly Tyr He Gly Asp 
385 390 395 

aaa aaa ttg gaa cct tea gga ate ttt gta aag age att aca aaa age 1309 
Lys Lys Leu Glu Pro Ser Gly lie Phe Val Lys Ser He Thr Lys Ser 
400 405 410 

agt gec gtt gag cat gat gga aga ate caa att gga gac caa att ata 1357 
Ser Ala Val Glu His Asp Gly Arg He Gin He Gly Asp Gin He He 
415 420 425 

gca gta gat ggc aca aac ctt cag ggt ttt act aat cag caa gca gta 1405 
Ala Val Asp Gly Thr Asn Leu Gin Gly Phe Thr Asn Gin Gin Ala Val 
430 435 440 445 



gag gta ttg cga cat aca gga caa act gtg etc ctg aca eta atg agg 
Glu Val Leu Arg His Thr Gly Gin Thr Val Leu Leu Thr Leu Met Arg 
450 455 460 



1453 
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aga gga atg aag cag gaa gcc gag etc atg tea agg gaa gac gtc aca 1501 
Arg Gly Met Lys Gin Glu Ala Glu Leu Met Ser Arg Glu Asp Val Thr 
465 470 475 

aaa gat gca gat ttg tct cct gtt aat gcc age ata ate aaa gaa aat 1549 
Lys Asp Ala Asp Leu Ser Pro Val Asn Ala Ser He He Lys Glu Asn 
480 485 490 

tat gaa aaa gat gaa gat ttt tta tct teg acg aga aac ace aac ata 1597 
Tyr Glu Lys Asp Glu Asp Phe Leu Ser Ser Thr Arg Asn Thr Asn He 
495 500 505 

tta cca act gaa gaa gaa ggg tat cca tta ctg tea get gag ata gaa 1645 
Leu Pro Thr Glu Glu Glu Gly Tyr Pro Leu Leu Ser Ala Glu lie Glu 
510 515 520 525 

gaa ata gaa gat gca caa aaa caa gaa get get ctg ctg aca aaa tgg 1693 
Glu He Glu Asp Ala Gin Lys Gin Glu Ala Ala Leu Leu Thr Lys Trp 
530 535 540 

caa agg att atg gga att aac tat gaa ata gtg gtg gcc cat gtg age 1741 
Gin Arg He Met Gly He Asn Tyr Glu He Val Val Ala His Val Ser 
545 550 555 

aag ttt agt gag aac agt gga ttg ggg ata age ctg gaa gcg aca gtg 1789 
Lys Phe Ser Glu Asn Ser Gly Leu Gly He Ser Leu Glu Ala Thr Val 
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560 565 570 

gga cat cat ttt ate cga tct gtt eta cca gag ggt cct gtt gga cac 1837 

Gly His His Phe lie Arg Ser Val Leu Pro Glu Gly Pro Val Gly His 
575 580 585 

age ggg aag etc ttc agt gga gac gag eta ttg gaa gta aat ggc ata 1885 

Ser Gly Lys Leu Phe Ser Gly Asp Glu Leu Leu Glu Val Asn Gly He 

590 595 600 605 

act tta ctt ggg gaa aat cac caa gat gtg gtg aat ate tta aaa gaa 1933 

Thr Leu Leu Gly Glu Asn His Gin Asp Val Val Asn He Leu Lys Glu 

610 615 620 

ctg cct ata gaa gtg aca atg gtg tgc tgt cgt cga act gtg cca ccc 1981 

Leu Pro He Glu Val Thr Met Val Cys Cys Arg Arg Thr Val Pro Pro 

625 630 635 

acc acc caa tea gaa ttg gat age ctg gac tta tgt gat att gag eta 2029 

Thr Thr Gltt.Ser Glu Leu Asp Ser Leu Asp Leu Cys Asp lie Glu Leu 

640 645 650 

aca gaa aag cct cac gta gat eta ggt gag ttc ate ggg tea tea gag 2077 

Thr Glu Lys Pro His Val Asp Leu Gly Glu Phe He Gly Ser Ser Glu 
655 660 665 

cca gag gat cca gtg ctg gcg atg act gat gcg ggt cag agt aca gaa 2125 
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Pro Glu Asp Pro Val Leu Ala Met Thr Asp Ala Gly Gin Ser Thr Glu 
670 675 680 685 

gag gtt caa gca cct ttg gcc atg tgg gag get ggc att cag cac ata 2173 
Glu Val Gin Ala Pro Leu Ala Met Trp Glu Ala Gly He Gin His He 
690 695 700 

atg ctg gag aaa ggg age aaa gga ctt ggt ttt age att tta gat tat 2221 
Met Leu Glu Lys Gly Ser Lys Gly Leu Gly Phe Ser He Leu Asp Tyr 
705 710 715 

cag gat cca att gat cca gca age act gtg att ata att cgt tct ttg 2269 
Gin Asp Pro He Asp Pro Ala Ser Thr Val He He He Arg Ser Leu 
720 725 730 

gtg cct ggc ggc att get gaa aag gat gga cga ctt ctt cct ggt gac 2317 
Val Pro Gly Gly He Ala Glu Lys Asp Gly Arg Leu Leu Pro Gly Asp 
735 740 745 

cga etc at|||!tt gta aac gat gtt aac ttg gaa aac age agt ctt gag 2365 
Arg Leu Met Phe Val Asn Asp Val Asn Leu Glu Asn Ser Ser Leu Glu 
750 755 760 765 

gaa get gta gaa gca ctg aag gga gca ccg tea ggg act gtg aga ata 2413 
Glu Ala Val Glu Ala Leu Lys Gly Ala Pro Ser Gly Thr Val Arg He 
770 775 780 
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gga gtt get aag cct tta ccc ctt tea cca gaa gaa ggt tat gtt tct 2461 
Gly Val Ala Lys Pro Leu Pro Leu Ser Pro Glu Glu Gly Tyr Val Ser 
785 790 795 

get aag gag gat tec ttt etc tac cca cca cac tec tgt gag gaa gca 2509 
Ala Lys Glu Asp Ser Phe Leu Tyr Pro Pro His Ser Cys Glu Glu Ala 
800 805 810 

ggg ctg get gac aaa ccc etc ttc agg get gac ttg get ctg gtg ggc 2557 
Gly Leu Ala Asp Lys Pro Leu Phe Arg Ala Asp Leu Ala Leu Val Gly 
815 820 825 

aca aat gat get gac tta gta gat gaa tec aca ttt gag tct cca tac 2605 
Thr Asn Asp Ala Asp Leu Val Asp Glu Ser Thr Phe Glu Ser Pro Tyr 
830 835 840 845 

tct cct gaa aat gac age ate tac tct act caa gee tct att tta tct 2653 
Ser Pro Glu Asn Asp Ser lie Tyr Ser Thr Gin Ala Ser lie Leu Ser 
850 855 860 

ctt cat ggc agt tct tgt ggt gat ggc ctg aac tat ggt tct tec ctt 2701 
Leu His Gly Ser Ser Cys Gly Asp Gly Leu Asn Tyr Gly Ser Ser Leu 
865 870 875 

cca tea tct cct cct aag gat gtt att gaa aat tct tgt gat cca gta 2749 
Pro Ser Ser Pro Pro Lys Asp Val He Glu Asn Ser Cys Asp Pro Val 
880 885 890 
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ctt gat ctg cat atg tct ctg gag gaa eta tat acc cag aat etc ctg 2797 
Leu Asp Leu His Met Ser Leu Glu Glu Leu Tyr Thr Gin Asn Leu Leu 
895 900 905 

gaa aga cag gat gag aat aca cct teg gtg gac ata agt atg ggg cct 2845 
Glu Arg Gin Asp Glu Asn Thr Pro Ser Val Asp He Ser Met Gly Pro 
910 915 920 925 

get tct ggc ttt act ata aat gac tac aca cct gca aat get att gaa 2893 
Ala Ser Gly Phe Thr He Asn Asp Tyr Thr Pro Ala Asn Ala He Glu 
930 935 940 

caa caa tat gaa tgt gaa aac aca ata gtg tgg act gaa tct cat tta 2941 
Gin Gin Tyr Glu Cys Glu Asn Thr He Val Trp Thr Glu Ser His Leu 
945 950 955 

cca agt gaa gtt ata tea agt gca gaa ctt cct tct gtg eta ccc gat 2989 
Pro Ser Glu Val lie Ser Ser Ala Glu Leu Pro Ser Val Leu Pro Asp 
960 965 970 

tea get gga aag ggc tct gag tac ctg ctt gaa cag age tec ctg gec 3037 
Ser Ala Gly Lys Gly Ser Glu Tyr Leu Leu Glu Gin Ser Ser Leu Ala 
975 980 985 

tgt aat get gag tgt gtc atg ctt caa aat gta tct aaa gaa tct ttt 3085 
Cys Asn Ala Glu Cys Val Met Leu Gin Asn Val Ser Lys Glu Ser Phe 
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990 995 1000 1005 

gaa agg act att aat ata gca aaa ggc aat tct age eta gga atg aca 3133 
Glu Arg Thr He Asn He Ala Lys Gly Asn Ser Ser Leu Gly Met Thr 
1010 1015 1020 

gtt agt get aat aaa gat ggc ttg ggg atg ate gtt cga age att att 3181 
Val Ser Ala Asn Lys Asp Gly Leu Gly Het lie Val Arg Ser He He 
1025 1030 1035 

cat gga ggt gec att agt cga gat ggc egg att gee att ggg gac tgc 3229 
His Gly Gly Ala He Ser Arg Asp Gly Arg He Ala He Gly Asp Cys 
1040 1045 1050 

ate ttg tec att aat gaa gag tct ace ate agt gta ace aat gee cag 3277 
He Leu Ser He Asn Glu Glu Ser Thr He Ser Val Thr Asn Ala Gin 
1055 1060 1065 

gca cga get atg ttg aga aga cat tct etc att ggc cct gac ata aaa 3325 
Ala Arg Ala Met Leu Arg Arg His Ser Leu He Gly Pro Asp lie Lys 
1070 1075 1080 1085 

att act tat gtg cct gca gaa cat ttg gaa gag ttc aaa ata age ttg 3373 
He Thr Tyr Val Pro Ala Glu His Leu Glu Glu Phe Lys He Ser Leu 
1090 1095 1100 



gga caa caa tct gga aga gta atg gca ctg gat att ttt tct tea tac 



3421 
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Gly Gin Gin Ser Gly Arg Val Met Ala Leu Asp He Phe Ser Ser Tyr 

1105 1110 1115 

act ggc aga gac att cca gaa tta cca gag cga gaa gag gga gag ggt 3469 

Thr Gly Arg Asp He Pro Glu Leu Pro Glu Arg Glu Glu Gly Glu Gly 

1120 1125 1130 

gaa gaa age gaa ctt caa aac aca gca tat age aat tgg aat cag ccc 3S17 

Glu Glu Ser Glu Leu Gin Asn Thr Ala Tyr Ser Asn Trp Asn Gin Pro 

1135 1140 1145 



agg egg gtg gaa ctt tgg aga gaa cca age aaa tec tta ggc ate age 3565 
Arg Arg Val Glu Leu Trp Arg Glu Pro Ser Lys Ser Leu Gly He Ser 
1150 1155 1160 1165 



att gtt ggt gga cga ggg atg ggg agt egg eta age aat gga gaa gtg 3613 
lie Val Gly Gly Arg Gly Met Gly Ser Arg Leu Ser Asn Gly Glu Val 
1170 1175 1180 



atg agg ggc att ttc ate aaa cat gtt ctg gaa gat agg cca get ggc 3661 
Met Arg Glr He Phe He Lys His Val Leu Glu Asp Arg Pro Ala Gly 
1185 1190 1195 



aaa aat gga acc ttg aag cct gga gat aga ate gta gag gtg gat gga 3709 
Lys Asn Gly Thr Leu Lys Pro Gly Asp Arg lie Val Glu Val Asp Gly 
1200 1205 1210 
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atg gac etc aga gat gca age cat gaa caa get gtg gaa gec att egg 3757 
Met Asp Leu Arg Asp Ala Ser His Glu Gin Ala Val Glu Ala He Arg 
1215 1220 1225 

aaa gca ggc aac cct gta gtc ttt atg gta cag age att ata aac aga 3805 
Lys Ala Gly Asn Pro Val Val Phe Met Val Gin Ser He He Asn Arg 
1230 1235 1240 1245 

cca agg aaa tec cct ttg cct tec ttg ctg cac aac ctt tac cct aag 3853 
Pro Arg Lys Ser Pro Leu Pro Ser Leu Leu His Asn Leu Tyr Pro Lys 
1250 1255 1260 

tac aac ttc age age act aac cca ttt get gac tct eta caa ate aac 3901 
Tyr Asn Phe Ser Ser Thr Asn Pro Phe Ala Asp Ser Leu Gin He Asn 
1265 1270 1275 

gec gac aag gca ccc agt cag tea gag tea gag cca gag aag get cca 3949 
Ala Asp Lys Ala Pro Ser Gin Ser Glu Ser Glu Pro Glu Lys Ala Pro 
128& 1285 1290 

ttg tgc agt gtg ccc cca ccc cct cct tea gee ttt gee gaa atg ggt 3997 
Leu Cys Ser Val Pro Pro Pro Pro Pro Ser Ala Phe Ala Glu Met Gly 
1295 1300 1305 

agt gat cac aca cag tea tct gca age aaa ate tea caa gat gtg gac 4045 
Ser Asp His Thr Gin Ser Ser Ala Ser Lys He Ser Gin Asp Val Asp 
1310 1315 1320 1325 
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aaa gag gat gag ttt ggt tac age tgg aaa aat ate aga gag cgt tat 4093 
Lys Glu Asp Glu Phe Gly Tyr Ser Trp Lys Asn lie Arg Glu Arg Tyr 
1330 1335 1340 

gga ace eta aca ggc gag ctg cat atg att gaa ctg gag aaa ggt cat 4141 
Gly Thr Leu Thr Gly Glu Leu His Met He Glu Leu Glu Lys Gly His 
1345 1350 1355 

agt ggt ttg ggc eta agt ctt get ggg aac aaa gac cga tec agg atg 4189 
Ser Gly Leu Gly Leu Ser Leu Ala Gly Asn Lys Asp Arg Ser Arg Met 
1360 1365 1370 

agt gtc ttc ata gtg sgg att gat cca aat gga get gca gga aaa gat 4237 
Ser Val Phe lie Val Gly He Asp Pro Asn Gly Ala Ala Gly Lys Asp 
1375 1380 1385 

ggt cga ttg caa att gca gat gag ctt eta gag ate aat ggt cag att 4285 
Gly Arg Leu 61a He Ala Asp Glu Leu Leu Glu lie Asn Gly Gin lie 
1390 ^ 1395 1400 1405 

tta tat gga aga agt cat cag aat gec tea tea ate att aaa tgt gec 4333 
Leu Tyr Gly Arg Ser His Gin Asn Ala Ser Ser lie lie Lys Cys Ala 
1410 1415 1420 

cct tct aaa gtg aaa ata att ttt ate aga aat aaa gat gca gtg aat 4381 
Pro Ser Lys Val Lys He He Phe He Arg Asn Lys Asp Ala Val Asn 



144/166 



1425 1430 1435 

cag atg gcc gta tgt cct gga aat gca gta gaa cct ttg cct tct aac 4429 
Gin Met Ala Val Cys Pro Gly Asn Ala Val Glu Pro Leu Pro Ser Asn 
1440 1445 1450 

tea gaa aat ctt caa aat aag gag aca gag cca act gtt act act tct 4477 
Ser Glu Asn Leu Gin Asn Lys Glu Thr Glu Pro Thr Val Thr Thr Ser 
1455 1460 1465 

gat gca get gtg gac etc agt tea ttt aaa aat gtg caa cat ctg gag 4525 
Asp Ala Ala Val Asp Leu Ser Ser Phe Lys Asn Val Gin His Leu Glu 
1470 1475 1480 1485 

ctt ccc aag gat cag ggg ggt ttg ggt att get ate age gaa gaa gat 4573 
Leu Pro Lys Asp Gin Gly Gly Leu Gly He Ala He Ser Glu Glu Asp 
1490 1495 1500 

aca etc agt gga gtc ate ata aag age tta aca gag cat ggg gta gca 4621 
Thr Leu Ser Gly Val lie He Lys Ser Leu Thr Glu His Gly Val Ala 
1505 1510 1515 

gcc acg gat gga cga etc aaa gtc gga gat cag ata ctg get gta gat 4669 
Ala Thr Asp Gly Arg Leu Lys Val Gly Asp Gin lie Leu Ala Val Asp 
1520 1525 1530 



gat gaa att gtt gtt ggt tac cct att gaa aag ttt att age ctt ctg 



4717 
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Asp Glu He Val Val Gly Tyr Pro He Glu Lys Phe He Ser Leu Leu 
1535 1540 1545 

aag aca gca aag atg aca gta aaa ctt acc ate cat get gag aat cca 4765 
Lys Thr Ala Lys Met Thr Val Lys Leu Thr lie His Ala Glu Asn Pro 
1550 1555 1560 1565 

gat tec cag get gtt cct tea gca get ggt gca gec agt gga gaa aaa 4813 
Asp Ser Gin Ala Val Pro Ser Ala Ala Gly Ala Ala Ser Gly Glu Lys 
1570 1575 1580 

aag aac age tec cag tct ctg atg gtc cca cag tct ggc tec cca gaa 4861 
Lys Asn Ser Ser Gin Ser Leu Met Val Pro Gin Ser Gly Ser Pro Glu 
1585 1590 1595 

ccg gag tec ate cga aat aca age aga tea tea aca cca gca att ttt 4909 
Pro Glu Ser He Arg Asn Thr Ser Arg Ser Ser Thr Pro Ala He Phe 
1600 1605 1610 

get tct gat cct gca acc tgc ccc att ate cct ggc tgc gaa aca acc 4957 
Ala Ser Asp Pro Ala Thr Cys Pro He He Pro Gly Cys Glu Thr Thr 
1615 1620 1625 

ate gag att tec aaa ggg cga aca ggg ctg ggc ctg age ate gtt ggg 5005 
He Glu lie Ser Lys Gly Arg Thr Gly Leu Gly Leu Ser He Val Gly 
16 30 1635 1640 1645 



146/166 



ggt tea gac acg ctg ctg ggt gec ttt att ate cat gaa gtt tat gaa 
Gly Ser Asp Thr Leu Leu Gly Ala Phe He He His Glu Val Tyr Glu 
1650 1655 1660 



5053 



gaa gga gca gca tgt aaa gat gga aga etc tgg get gga gat cag ate 
Glu Gly Ala Ala Cys Lys Asp Gly Arg Leu Trp Ala Gly Asp Gin lie 
1665 1670 1675 



5101 



tta gag gtg aat gga att gac ttg agg aag gee aca cat gat gaa gca 
Leu Glu Val Asn Gly He Asp Leu Arg Lys Ala Thr His Asp Glu Ala 
1680 1685 1690 



5149 



ate aat gtc ctg aga cag acg cca cag aga gtg cgc ctg aca etc tac 
He Asn Val Leu Arg Gin Thr Pro Gin Arg Val Arg Leu Thr Leu Tyr 
1695 1700 1705 



5197 



aga gat gag gec cca tac aaa gag gag gaa gtg tgt gac ace etc act 
Arg Asp Glu Ala Pro Tyr Lys Glu Glu Glu Val Cys Asp Thr Leu Thr 
1710 1715 1720 - 1725 



5245 



att gag ctg cag aag aag ccg gga aaa ggc eta gga tta agt att gtt 
He Glu Leu Gin Lys Lys Pro Gly Lys Gly Leu Gly Leu Ser He Val 
1730 1735 1740 



5293 



ggt aaa aga aac gat act gga gta ttt gtg tea gac 
Gly Lys Arg Asn Asp Thr Gly Val Phe Val Ser Asp 
1745 1750 



att gtc aaa gga 
He Val Lys Gly 
1755 



5341 
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gga att gca gat ccc gat gga aga ctg ate cag gga gac cag ata tta 5389 
Gly He Ala Asp Pro Asp Gly Arg Leu He Gin Gly Asp Gin He Leu 
1760 1765 1770 

ttg gtg aat ggg gaa gac gtt cgt aat gec tec caa gaa gcg gtt gec 5437 
Leu Val Asn Gly Glu Asp Val Arg Asn Ala Ser Gin Glu Ala Val Ala 
1775 1780 1785 

get ttg eta aag tgt tec eta ggc aca gta acc ttg gaa gtt gga aga 5485 
Ala Leu Leu Lys Cys Ser Leu Gly Thr Val Thr Leu Glu Val Gly Arg 
1790 1795 1800 1805 

ate aaa get ggt cca ttc cat tea gag agg agg cca tct caa acc age 5533 
He Lys Ala Gly Pro Phe His Ser Glu Arg Arg Pro Ser Gin Thr Ser 
1810 1815 1820 

cag gtg agt gaa ggc age ctg tct tct ttc act ttt cca etc tct gga 5581 
Gin Val Ser Glu Gly Ser Leu Ser Ser Phe Thr Phe Pro Leu Ser Gly 
v 1825 1830 1835 

tec agt aca tct gag tea ctg gaa agt age tea aag aag aat gca ttg 5629 
Ser Ser Thr Ser Glu Ser Leu Glu Ser Ser Ser Lys Lys Asn Ala Leu 
1840 1845 1850 

gca tct gaa ata cag^ gga tta aga aca gtc gaa atg aaa aag ggc cct 5677 
Ala Ser Glu He Gin Gly Leu Arg Thr Val Glu Met Lys Lys Gly Pro 
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1855 i860 1865 

act gac tea ctg gga ate age ate get gga gga gta ggc age cca ctt 5725 
Thr Asp Ser Leu Gly He Ser lie Ala Gly Gly Val Gly Ser Pro Leu 
1870 1875 1880 1885 

ggt gat gtg cct ata ttt att gca atg atg cac cca act gga gtt gca 5773 
Gly Asp Val Pro He Phe lie Ala Met Met His Pro Thr Gly Val Ala 
1890 1895 1900 

gca cag acc caa aaa etc aga gtt ggg gat agg att gtc acc ate tgt 5821 
Ala Gin Thr Gin Lys Leu Arg Val Gly Asp Arg He Val Thr He Cys 
1905 1910 1915 

ggc aca tec act gag ggc atg act cac acc caa gca gtt aac eta ctg 5869 
Gly Thr Ser Thr Glu Gly Met Thr His Thr Gin Ala Val Asn Leu Leu 
1920 1925 1930 

aaa aat gca tct ggc tec att gaa atg cag gtg gtt get gga gga gac 5917 
Lys Asn Al* Ser Gly Ser He Glu Met Gin Val Val Ala Gly Gly Asp 
1935 1940 1945 

gtg agt gtg gtc aca ggt cat cat cag gag cct gca agt tec agt ctt 5965 
Val Ser Val Val Thr Gly His His Gin Glu Pro Ala Ser Ser Ser Leu 
1950 1955 I960 1965 



tct ttc act ggg ctg acg tea acc agt ata ttt cag gat gat tta gga 



6013 
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Ser Phe Thr Gly Leu Thr Ser Thr Ser lie Phe Gin Asp Asp Leu Gly 
1970 1975 1980 

cct cct caa tgt aag tct att aca eta gag cga gga cca gat ggc tta 6061 
Pro Pro Gin Cys Lys Ser He Thr Leu Glu Arg Gly Pro Asp Gly Leu 
1985 1990 1995 

ggc ttc agt ata gtt gga gga tat ggc age cct cat gga gac tta ccc 6109 
Gly Phe Ser He Val Gly Gly Tyr Gly Ser Pro His Gly Asp Leu Pro 
2000 2005 2010 

att tat gtt aaa aca gtg ttt gca aag gga gca gec tct gaa gac gga 6157 
He Tyr Val Lys Thr Val Phe Ala Lys Gly Ala Ala Ser Glu Asp Gly 
2015 2020 2025 

cgt ctg aaa agg ggc gat cag ate att get gtc aat ggg cag agt eta 6205 
Arg Leu Lys Arg Gly Asp Gin He He Ala Val Asn Gly Gin Ser Leu 
2030 2035 2040 2045 

gaa gga gtc acc cat gaa gaa get gtt gec ate ctt aaa egg aca aaa 6253 
Glu Gly Val Thr His Glu Glu Ala Val Ala He Leu Lys Arg Thr Lys 
2050 2055 2060 

ggc act gtc act ttg atg gtt etc tct tgaattggct gecagaattg 6300 
Gly Thr Val Thr Leu Met Val Leu Ser 
2065 2070 
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aaccaaccca acccctagct cacctcctac tgtaaagaga atgcactggt cctgacaatt 6360 

tttatgctgt gttcagccgg gtcttcaaaa ctgtaggggg gaaataacac ttaagtttct 6420 

ttttctcatc tagaaatgct ttccttactg acaacctaac atcatttttc ttttcttctt 6480 

gcattttgtg aacttaaaga gaaggaatat ttgtgtaggt gaatctcgtt tttatttgtg 6540 

gagatatcta atgttttgta gtcacatggg caagaattat tacatgctaa gctggttagt 6600 

ataaagaaag ataattctaa agctaaccaa agaaaatggc ttcagtaagt taggatgaaa 6660 

aatgaaaata taaaataaag aagaaaatct cggggagttt aaaaaaaatg cctcaatttg 6720 

gcaatctacc tcctctcccc accccaaact 6750 

<210> 87 

<211> 6666 

<212> DNA 

< 2 1 3 > Homo sapience 

<220> 
<221> CDS 

<222> (71).. .(3787) 
<400> 87 

ttctcagtca cgcagttcca ttttaattgc tgttaatcat ttcagagaag aacactgaac 60 



151/166 



tttgaaaaaa atg ttg gaa gcc att gac aaa aat egg gec ctg cat gca 109 
Met Leu Glu Ala He Asp Lys Asn Arg Ala Leu His Ala 
1 5 10 

gca gag cgc ttg caa acc aag ctg cga gaa cgt ggg gat gta gca aat 157 

Ala Glu Arg Leu Gin Thr Lys Leu Arg Glu Arg Gly Asp Val Ala Asn 
15 20 25 

gaa gac aaa ctg age ctt ctg aag tea gtc ctg cag age cct etc ttc 206 

Glu Asp Lys Leu Ser Leu Leu Lys Ser Val Leu Gin Ser Pro Leu Phe 

30 35 40 45 

agt cag att ctg age ctt cag act tct gta cag cag ctg aaa gac cag 253 

Ser Gin lie Leu Ser Leu Gin Thr Ser Val Gin Gin Leu Lys Asp Gin 

50 55 60 

gta aat att gca act tea gca act tea aat att gaa tat gcc cac gtt 301 

Val Asn lie Ala Thr Ser Ala Thr Ser Asn lie Glu Tyr Ala His Val 

j|\65 70 75 

cct cat etc age cca get gtg att cct act ctg caa aat gaa teg ttt 349 

Pro His Leu Ser Pro Ala Val He Pro Thr Leu Gin Asn Glu Ser Phe 
80 85 90 



tta tta tec cca aac aat ggg aat ctg gaa gca ctt aca gga cct ggt 
Leu Leu Ser Pro Asn Asn Gly Asn Leu Glu Ala Leu Thr Gly Pro Gly 



397 
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95 100 

att cca cac att aat ggg aaa cct get 
lie Pro His He Asn Gly Lys Pro Ala 
110 115 

ate aaa aat atg gee cag ggt cgc cat 
He Lys Asn Met Ala Gin Gly Arg His 
130 

aaa cct cca tct gga ggc ctt ggg ttt 
Lys Pro Pro Ser Gly Gly Leu Gly Phe 
145 150 

gaa aac aga gga gag ctg gga ata ttt 
Glu Asn Arg Gly Glu Leu Gly lie Phe 
160 165 

agt gtg gee cat aga gat gga aga ttg 
Ser Val A^^ft Arg Asp Gly Arg Leu 
175 180 

get ate aat gga cag get ctt gat cag 
Ala lie Asn Gly Gin Ala Leu Asp Gin 
190 195 

ate age ate ctg cag aaa gee aaa gat 



105 

tgt gat gaa ttt gat cag ctt 445 
Cys Asp Glu Phe Asp Gin Leu 
120 125 

gta gaa gtt ttt gag etc etc 493 
Val Glu Val Phe Glu Leu Leu 
135 140 

agt gtt gtg gga eta aga agt 541 
Ser Val Val Gly Leu Arg Ser 
155 

gtt caa gag ata caa gag ggc 589 
Val Gin Glu He Gin Glu Gly 
170 

aaa gaa act gat caa att ctt 637 
Lys Glu Thr Asp Gin He Leu 
185 

aca att aca cat cag cag get 685 
Thr He Thr His Gin Gin Ala 
200 205 

act gtc cag eta gtt att gec 733 
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He Ser He Leu Gin Lys Ala Lys Asp Thr Val Gin Leu Val lie Ala 
210 215 220 



aga ggc tea ttg cct cag ctt gtc age ccc ata gtt tec cgt tct cca 
Arg Gly Ser Leu Pro Gin Leu Val Ser Pro lie Val Ser Arg Ser Pro 
225 230 235 



781 



tct gca gee age aca att tea get cac tct aat ccg gtt cac tgg caa 
Ser Ala Ala Ser Thr He Ser Ala His Ser Asn Pro Val His Trp Gin 
240 245 250 



829 



cac atg gaa acg att gaa ttg gtg aat gat gga tct ggt ttg gga ttt 
His Met Glu Thr He Glu Leu Val Asn Asp Gly Ser Gly Leu Gly Phe 
255 260 265 



877 



ggc ate ata gga gga aaa gca act ggt gtg ata gta aaa acc att ctg 
Gly He He Gly Gly Lys Ala Thr Gly Val He Val Lys Thr He Leu 
270 275 280 285 



925 



cct gga gg^jfta get gat cag cat ggg cgt tta tgc agt gga gac cac 



Pro Gly Gly Val Ala Asp Gin His Gly Arg Leu Cys Ser Gly Asp His 
290 295 300 



973 



att eta aag att ggt gac aca gat eta gca gga atg age agt gag caa 
He Leu Lys He Gly Asp Thr Asp Leu Ala Gly Met Ser Ser Glu Gin 
305 310 315 



1021 
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gta gca caa gtc ctt agg caa tgt gga aat aga gtt aag ttg atg att 1069 
Val Ala Gin Val Leu Arg Gin Cys Gly Asn Arg Val Lys Leu Met He 
320 325 330 

gca aga agt gcc ata gaa gaa cgt aca gca ccc act get ttg ggc ate 1117 
Ala Arg Ser Ala He Glu Glu Arg Thr Ala Pro Thr Ala Leu Gly He 
335 340 345 

ace etc tec tea tec cca act tea acg cca gag ttg egg gtt gat get 1165 
Thr Leu Ser Ser Ser Pro Thr Ser Thr Pro Glu Leu Arg Val Asp Ala 
350 355 360 365 

tct act cag aaa ggt gaa gaa agt gag aca ttt gat gta gaa etc act 1213 
Ser Thr Gin Lys Gly Glu Glu Ser Glu Thr Phe Asp Val Glu Leu Thr 
370 375 380 

aaa aat gtc caa gga tta gga att ace att get ggc tac att gga gat 1261 
Lys Asn Val Gin Gly Leu Gly He Thr He Ala Gly Tyr He Gly Asp 
385 390 395 

aaa aaa ttg gaa cct tea gga ate ttt gta aag age att aca aaa age 1309 
Lys Lys Leu Glu Pro Ser Gly He Phe Val Lys Ser He Thr Lys Ser 
400 405 410 

agt gcc gtt gag cat gat gga aga ate caa att gga gac caa att ata 1357 
Ser Ala Val Glu His Asp Gly Arg He Gin He Gly Asp Gin He He 
415 420' 425 
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gca gta gat ggc aca aac ctt cag ggt 
Ala Val Asp Sly Thr Asn Leu Gin Gly 
430 435 

gag gta ttg cga cat aca gga caa act 
Glu Val Leu Arg His Thr Gly Gin Thr 
450 

aga gga atg aag cag gaa gcc gag etc 
Arg Gly Met Lys Gin Glu Ala Glu Leu 
465 470 

aaa gat gca gat ttg tct cct gtt aat 
Lys Asp Ala Asp Leu Ser Pro Val Asn 
480 485 

tat gaa aaa gat gaa gat ttt tta tct 
Tyr Glu Lys Asp Glu Asp Phe Leu Ser 
495 500 

tta cca act gaa gaa gaa ggg tat cca 
Leu Pro Thr Glu Glu Glu Gly Tyr Pro 
510 515 

gaa ata gaa gat gca caa aaa caa gaa 
Glu lie Glu Asp Ala Gin Lys Gin Glu 



ttt act aat cag caa gca gta 1405 
Phe Thr Asn Gin Gin Ala Val 
440 445 

gtg etc ctg aca eta atg agg 1453 
Val Leu Leu Thr Leu Met Arg 
455 460 

atg tea agg gaa gac gtc aca 1501 
Met Ser Arg Glu Asp Val Thr 
475 

gcc age ata ate aaa gaa aat 1549 
Ala Ser He lie Lys Glu Asn 
490 

teg acg aga aac acc aac ata 1597 
Ser Thr Arg Asn Thr Asn He 
505 

tta ctg tea get gag ata gaa 1645 
Leu Leu Ser Ala Glu lie Glu 
520 525 

get get ctg ctg aca aaa tgg 1693 
Ala Ala Leu Leu Thr Lys Trp 
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530 535 540 

caa agg att atg gga att aac tat gaa ata gtg gtg gcc cat gtg age 1741 
Gin Arg He Met Gly He Asn Tyr Glu He Val Val Ala His Val Ser 
545 550 555 

aag ttt agt gag aac agt gga ttg ggg ata age ctg gaa gcg aca gtg 1789 
Lys Phe Ser Glu Asn Ser Gly Leu Gly He Ser Leu Glu Ala Thr Val 
560 565 570 

gga cat cat ttt ate cga tct gtt eta cca gag ggt cct gtt gga cac 1837 
Gly His His Phe He Arg Ser Val Leu Pro Glu Gly Pro Val Gly His 
575 580 585 

age ggg aag etc ttc agt gga gac gag eta ttg gaa gta aat ggc ata 1885 
Ser Gly Lys Leu Phe Ser Gly Asp Glu Leu Leu Glu Val Asn Gly lie 
590 595 600 605 

act tta ctt ggg gaa aat cac caa gat gtg gtg aat ate tta aaa gaa 1933 
Thr Leu I0§ht Glu Asn His Gin Asp Val Val Asn lie Leu Lys Glu 
610 615 620 

ctg cct ata gaa gtg aca atg gtg tgc tgt cgt cga act gtg cca ccc 1981 
Leu Pro lie Glu Val Thr Met Val Cys Cys Arg Arg Thr Val Pro Pro 
625 630 635 

ace acc caa tea gaa ttg gat age ctg gac tta tgt gat att gag eta 2029 
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Thr Thr Gin Ser Glu Leu Asp Ser Leu Asp Leu Cys Asp lie Glu Leu 
640 645 650 



aca gaa aag cct cac gta gat eta ggt gag ttc ate ggg tea tea gag 
Thr Glu Lys Pro His Val Asp Leu Gly Glu Phe He Gly Ser Ser Glu 
655 660 665 



2077 



cca gag gat cca gtg ctg gcg atg act gat gcg ggt cag agt aca gaa 
Pro Glu Asp Pro Val Leu Ala Met Thr Asp Ala Gly Gin Ser Thr Glu 
670 675 680 685 



2125 



gag gtt caa gca cct ttg gee atg tgg gag get ggc att cag cac ata 
Glu Val Gin Ala Pro Leu Ala Met Trp Glu Ala Gly He Gin His He 
690 695 700 



2173 



atg ctg gag aaa ggg age aaa gga ctt ggt ttt age att tta gat tat 
Met Leu Glu Lys Gly Ser Lys Gly Leu Gly Phe Ser He Leu Asp Tyr 
705 710 715 



2221 



cag gat d^ffi|tt. gat cca gca age act gtg att ata att cgt tct ttg 
Gin Asp Ww$X* Asp Pro Ala Ser Thr Val He He He Arg Ser Leu 
720 725 730 



2269 



gtg cct ggc ggc 
Val Pro Gly Gly 
735 



att get gaa aag gat gga cga ctt ctt cct 
lie Ala Glu Lys Asp Gly Arg Leu Leu Pro 
740 745 



ggt gac 
Gly Asp 



2317 
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cga etc atg ttt gta aac gat gtt aac ttg gaa aac age agt ctt gag 
Arg Leu Met Phe Val Asn Asp Val Asn Leu Glu Asn Ser Ser Leu Glu 
750 755 760 765 



2365 



gaa get gta gaa gca ctg aag gga gca ccg tea ggg act gtg aga ata 
Glu Ala Val Glu Ala Leu Lys Gly Ala Pro Ser Gly Thr Val Arg He 
770 775 780 



2413 



gga gtt get aag cct tta ccc ctt tea cca gaa gaa ggt tat gtt tct 
Gly Val Ala Lys Pro Leu Pro Leu Ser Pro Glu Glu Gly Tyr Val Ser 
785 790 795 



2461 



get aag gag gat tec ttt etc tac cca cca cac tec tgt gag gaa gca 
Ala Lys Glu Asp Ser Phe Leu Tyr Pro Pro His Ser Cys Glu Glu Ala 
800 805 810 



2509 



ggg ctg get gac aaa ccc etc ttc agg get gac ttg get ctg gtg ggc 
Gly Leu Ala Asp Lys Pro Leu Phe Arg Ala Asp Leu Ala Leu Val Gly 
815 820 825 



aca aat gat get gac tta gta gat gaa tec aca ttt gag tct cca tac 
Thr Asn Asp Ala Asp Leu Val Asp Glu Ser Thr Phe Glu Ser Pro Tyr 
830 835 840 845 



2557 



2605 



tct cct gaa aat gac age ate tac tct act caa gee tct att tta tct 
Ser Pro Glu Asn Asp Ser He Tyr Ser Thr Gin Ala Ser He Leu Ser 
850 855 860 



2653 
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ctt cat ggc agt tct tgt ggt gat ggc ctg aac tat ggt tct tec ctt 
Leu His Gly Ser Ser Cys Gly Asp Gly Leu Asn Tyr Gly Ser Ser Leu 
865 870 875 



2701 



cca tea tct cct cct aag gat gtt att gaa aat tct tgt gat cca gta 
Pro Ser Ser Pro Pro Lys Asp Val lie Glu Asn Ser Cys Asp Pro Val 
880 885 890 



2749 



ctt gat ctg cat atg tct ctg gag gaa eta tat acc cag aat etc ctg 
Leu Asp Leu His Met Ser Leu Glu Glu Leu Tyr Thr Clin Asn Leu Leu 
895 900 905 



2797 



gaa aga cag gat gag aat aca cct teg gtg gac ata agt atg ggg cct 
Glu Arg Gin Asp Glu Asn Thr Pro Ser Val Asp lie Ser Met Gly Pro 
910 915 920 925 



2845 



get tct ggc ttt act ata aat gac tac aca cct gca aat get att gaa 
Ala Ser Gly Phe Thr He Asn Asp Tyr Thr Pro Ala Asn Ala He Glu 
93a 935 940 



2893 



caa caa tat gaa tgt gaa aac aca ata gtg tgg act gaa tct cat tta 
Gin Gin Tyr Glu Cys Glu Asn Thr He Val Trp Thr Glu Ser His Leu 
945 950 955 



2941 



cca agt gaa gtt ata tea agt gca gaa ctt cct tct gtg eta ccc gat 2989 
Pro Ser Glu Val He Ser Ser Ala Glu Leu Pro Ser Val Leu Pro Asp 



i 
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960 965 970 

tea get gga aag ggc tct gag cac ctg ctt gaa cag age tec ctg gec 3037 
Ser Ala Gly Lys Gly Ser Glu His Leu Leu Glu Gin Ser Ser Leu Ala 
975 980 985 

tgt aat get gag tgt gtc atg ctt caa aat gta tct aaa gaa tct ttt 3085 
Cys Asn Ala Glu Cys Val Met Leu Gin Asn Val Sep Lys Glu Ser Phe 
990 995 1000 1005 

gaa agg act att aat ata gca aaa ggc aat tct age eta gga atg aca 3133 
Glu Arg Thr He Asn He Ala Lys Gly Asn Ser Ser Leu Gly Met Thr 
1010 1015 1020 

gtt agt get aat aaa gat ggc ttg ggg atg ate gtt cga age att att 3181 
Val Ser Ala Asn Lys Asp Gly Leu Gly Met He Val Arg Ser He lie 
1025 1030 1035 

cat gga ggt gec att agt cga gat ggc egg att gec att ggg gac tgc 3229 
His Gly Gl^ Ala lie Ser Arg Asp Gly Arg He Ala He Gly Asp Cys 
1040 1045 1050 

ate ttg tec att aat gaa gag tct acc ate agt gta acc aat gec cag 3277 
He Leu Ser He Asn Glu Glu Ser Thr He Ser Val Thr Asn Ala Gin 
1055 1060 1065 



gca cga get atg ttg aga aga cat tct etc att ggc cct gac ata aaa 



3325 
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Ala Arg Ala Met Leu Arg Arg His Ser Leu lie Gly Pro Asp lie Lys 
1070 1075 1080 1085 

att act tat gtg cct gca gaa cat ttg gaa gag ttc aaa ata age ttg 3373 
He Thr Tyr Val Pro Ala Glu His Leu Glu Glu Phe Lys He Ser Leu 
1090 1095 1100 

gga caa caa tct gga aga gta atg gca ctg gat att ttt tct tea tac 3421 
Gly Gin Gin Ser Gly Arg Val Met Ala Leu Asp He Phe Ser Ser Tyr 
1105 1110 1115 

act ggc aga gac att cca gaa tta cca gag cga gaa gag gga gag ggt 3469 
Thr Gly Arg Asp He Pro Glu Leu Pro Glu Arg Glu Glu Gly Glu Gly 
H20 1125 1130 

gaa gaa age gaa ctt caa aac aca gca tat age aat tgg aat cag ccc 3517 
Glu Glu Ser Glu Leu Gin Asn Thr Ala Tyr Ser Asn Trp Asn Gin Pro 
1135 1140 1145 

agg egg gtg, gaa etc tgg aga gaa cca age aaa tec tta ggc ate age 3565 
Arg Arg Val Glu Leu Trp Arg Glu Pro Ser Lys Ser Leu Gly He Ser 
1150 1155 1160 1165 

att gtt ggt gga cga ggg atg ggg agt egg eta age aat gga gaa gtg 3613 
He Val Gly Gly Arg Gly Met Gly Ser Arg Leu Ser Asn Gly Glu Val 
1170 1175 1180 



162/166 



atg agg ggc att ttc ate aaa cat gtt ctg gaa gat agt cca get ggc 3661 
Met Arg Gly lie Phe He Lys His Val Leu Glu Asp Ser Pro Ala Gly 
1185 1190 1195 

aaa aat gga acc ttg aaa cct gga gat aga ate gta gag gtg gat gga 3709 
Lys Asn Gly Thr Leu Lys Pro Gly Asp Arg He Val Glu Val Asp Gly 
1200 1205 1210 

atg gac etc aga gat gca age cat gaa caa get gtg gaa gee att egg 3757 
Met Asp Leu Arg Asp Ala Ser His Glu Gin Ala Val Glu Ala He Arg 
1215 1220 1225 

aaa gca ggc aac cct gta gtc ttt atg gta tagagcttta ttacagacca 3807 
Lys Ala Gly Asn Pro Val Val Phe Met Val 
1230 1235 

agggcaccca gtcagtcagt cagagtcaga gtcagagtca gagecagaga aggctccatt 3867 

gtgcagtgtg cccccacccc ctccttcagc etttgecgaa atgggtagtg atcacacaca 3927 

gtcatctgea agcaaaatct cacaagatgt ggacaaagag gatgagtttg gttacagctg 3987 

gaaaaatatc agagagegtt atggaaccct aacaggegag ctgcatatga ttgaactgga 4047 

gaaaggtcat agtggtttgg gectaagtet tgctgggaac aaagaccgat ccaggatgag 4107 



tgtcttcata gtggggattg atccaaatgg agetgeagga aaagatggtc gattgeaaat 4167 
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tgcagatgag cttctagaga tcaatggtca gattttatat ggaagaagtc atcagaatgc 4227 

ctcatcaatc attaaatgtg ccccttctaa agtgaaaata atttttatca gaaataaaga 4287 

tgcagtgaat cagatggccg tatgtcctgg aaatgcagta gaacctttgc cttctaactc 4347 

agaaaatctt caaaataagg agacagagcc aactgttact acttctgatg cagctgtgga 4407 

cctcagttca tttaaaaatg tgcaacatct ggagcttccc aaggatcagg ggggtttggg 4467 

tattgctatc agcgaagaag atacactcag tggagtcatc ataaagagct taacagagca 4527 

tggggtagca gccacggatg gacgactcaa agtcggagat cagatactgg ctgtagatga 4587 

tgaaattgtt gttggttacc ctattgaaaa gtttattagc cttctgaaga cagcaaagat 4647 

gacagtaaaa cttaccatcc atgctgagaa tccagattcc caggctgttc cttcagcagc 4707 

tggtgcagcc agtggagaaa aaaagaacag ctcccagtct ctgatggtcc cacagtctgg 4767 

ctccccagaa ccggagtcca tccgaaatac aagcagatca tcaacaccag caatttttgc 4827 

ttctgatcct gcaacctgcc ccattatccc tggctgcgaa acaaccatcg agatttccaa 4887 

agggcgaaca gggctgggcc tgagcatcgt tgggggttca gacacgctgc tgggtgcctt 4947 
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tattatccat gaagtttatg aagaaggagc agcatgtaaa gatggaagac tctgggctgg 5007 

agatcagatc ttagaggtga atggaattga cttgaggaag gccacacatg atgaagcaat 5067 

caatgtcctg agacagacgc cacagagagt gcgcctgaca ctctacagag atgaggcccc 5127 

atacaaagag gaggaagtgt gtgacaccct cactattgag ctgcagaaga agccgggaaa 5187 

aggcctagga ttaagtattg ttggtaaaag aaacgatact ggagtatttg tgtcagacat 5247 

tgtcaaagga ggaattgcag atcccgatgg aagactgatc cagggagacc agatattatt 5307 

ggtgaatggg gaagacgttc gtaatgcctc ccaagaagcg gttgccgctt tgctaaagtg 5367 

ttccctaggc acagtaacct tggaagttgg aagaatcaaa gctggtccat tccattcaga 5427 

gaggaggcca tctcaaacca gccaggtgag tgaaggcagc ctgtcttctt tcacttttcc 5487 

actctctgga tccagtacat ctgagtcact ggaaagtagc tcaaagaaga atgcattggc 5547 

atctgaaata cagggattaa gaacagtcga aatgaaaaag ggccctactg actcactggg 5607 

aatcagcatc gctggaggag taggcagccc acttggtgat gtgcctatat ttattgcaat 5667 

gatgcaccca actggagttg cagcacagac ccaaaaactc agagttgggg ataggattgt 5727 

caccatctgt ggcacatcca ctgagggcat gactcacacc caagcagtta acctactgaa 5787 
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aaatgcatct ggctccattg aaatgcaggt ggttgctgga ggagacgtga gtgtggtcac 5847 

aggtcatcat caggagcctg caagttccag tctttctttc actgggctga cgtcaaccag 5907 

tatatttcag gatgatttag gacctcctca atgtaagtct attacactag agcgaggacc 5967 

agatggctta ggcttcagta tagttggagg atatggcagc cctcatggag acttacccat 6027 

ttatgttaaa acagtgtttg caaagggagc agcctctgaa gacggacgtc tgaaaagggg 6087 

cgatcagatc attgctgtca atgggcagag tctagaagga gtcacccatg aagaagctgt 6147 

tgccatcctt aaacggacaa aaggcactgt cactttgatg gttctctctt gaattggctg 6207 

ccagaattga accaacccaa cccctagctc acctcctact gtaaagagaa tgcactggtc 6267 

ctgacaattt ttatgctgtg ttcagccggg tcttcaaaac tgtagggggg aaataacact 6327 

taagtttctt tttctcatct agaaatgctt tccttactga caacctaaca tcatttttct 6387 

tttcttcttg cattttgtga acttaaagag aaggaatatt tgtgtaggtg aatctcgttt 6447 

ttatttgtgg agatatctaa tgttttgtag tcacatgggc aagaattatt acatgctaag 6507 

ctggttagta taaagaaaga taattctaaa gctaaccaaa gaaaatggct tcagtaagtt 6567 
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ggatgaaaa atgaaaatat aaaataaaga agaaaatctc ggggagttta aaaaaaatgc 6627 



tcaatttgg caatctacct cctctcccca ccccaaact 6666 



